
Gianfranco Capriz 
Paolo Maria Mariano 

Editors 



SPRINGER SCIENCE+BUSINESS MEDIA, LLC 






Modeling and Simulation in Science, Engineering and Technology 



Series Editor 
Nicola Bellomo 
Politecnico di Torino 
Italy 



HG. Cy^/ 77 eA(Mathematical Biology) 
Department of Mathematics 
University of Minnesota 
270A Vincent Hall 
Minneapolis, MN 55455, USA 

othmerQmath . mnn . edu 



Advisory Editoriai Board 

M. Avellaneda (Modeling in Economy) 

Courant Institute of Mathematical Sciences 

New York University 

251 Mercer Street 

New York, NY 10012, USA 

avellanedaQcims.nyu.edu 

K.J. Bathe (Solid Mechanics) 

Department of Mechanical Engineering 
Massachusetts Institute of Technology 
Cambridge, MA 02139, USA 

kjbOmit . edu 

R (Semiconductor & Transport Modeling) 
Mathematiques pour I'lndustrie et la Physique 
Universite P. Sabatier Toulouse 3 
118 Route de Narbonne 
31062 Toulouse Cedex, France 

degondOmip . ups-tlse . f r 

MA Herrero ( 75 /r/^ (Mathematical Methods) 
Departamento de Matematica Aplicada 
Universidad Complutense de Madrid 
Avenida Complutense s/n 
28040 Madrid, Spain 
herreroQsunma4 . mat . ucm . es 

W. A 5 fe/ 77 ^/ 7/7 (Stochastic Modeling) 

Department of Mathematics 
Iowa State University 
400 Carver Hall 
Ames, lA 50011, USA 
kliemeuinQ last ate . edu 



L /^/■( 9 z/b 5 /(lndustrial Mathematics) 

Dipartimento di Matematica 
Politecnico di Torino 
Corso Duca degli Abruzzi 24 
10129 Torino, Italy 
preziosiQpolito . it 

V. Protopopescu(Oom^Q\\Xm Systems, Epistemology) 
CSMD 

Oak Ridge National Laboratory 
Oak Ridge, TN 37831-6363, USA 

vvpQepmnas . epm . ornl . gov 

K.R. /?.^ 5 ^cy 7 j/(Multiphase Flows) 

Department of Mechanical Engineering 
A&M University 

College Station, TX 77843, USA 
KRa j agopalQmengr . tamu . edu 

Y Sone (Fluid Dynamics in Engineering Sciences) 

Professor Emeritus 

Kyoto University 

230-133 Iwakura-Nagatani-cho 

Sakyo-ku Kyoto 606-0026, Japan 

soneQyoshio . mbox . media . kyoto-u .ac.jp 




Advances in Multifield 
Theories for Continua 
with Substructure 



Gianfranco Capriz 
Paolo Maria Mariano 

Editors 



Springer Science+Business Media, LLC 




Gianfranco Capriz Paolo Maria Mariano 

Dipartimento di Matematica Dipartimento di Ingegneria Strutturale 

University di Pisa e Geotecnica 

56127 Pisa University di Roma "La Sapienza" 

Italy 00184 Roma 

Italy 

Library of Congress Cataloging-in-Publication Data 

Advances in multified theories for continua with substructure / Gianfranco Capriz, Paolo 
Maria Mariano, editors. 

p. cm. - (Modeling and simulation in science, engineering and technology) 

ISBN 978-1-4612-6465-1 ISBN 978-0-8176-8158-6 (eBook) 

DOI 10.1007/978-0-8176-8158-6 

1. Continuum mechanics. I. Capriz, G. (Gianfranco) II. Mariano, Paolo Maria, 1966- 
III. Series. 

QA808.2.A385 2003 

531^c21 2003052406 

CIP 



AMS Subject Classifications: 74A10, 74A15, 74A30, 74A40, 74A35, 74A45, 74A50, 74A60, 
74C15, 74N05, 74N15, 74N20, 74N30, 74R15, 74E25, 74E30, 74H33, 76F25, 76A05 



Printed on acid-free paper. ® 

© 2004 Springer Science+Business Media New York ^ 

Originally published by Birkhauser Boston in 2004 
Softcover reprint of the hardcover 1st edition 2004 

All rights reserved. This work may not be translated or copied in whole or in part without the written 
permission of the publisher (Springer Science+Business Media, LLC ), except for brief excerpts in 
connection with reviews or scholarly analysis. Use in connection with any form of information storage 
and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now 
known or hereafter developed is forbidden. 

The use of general descriptive names, trade names, trademarks, etc., in this publication, even if the 
former are not especially identified, is not to be taken as a sign that such names, as understood by the 
Trade Marks and Merchandise Marks Act, may accordingly be used freely by anyone. 



ISBN 978-1-4612-6465-1 SPIN 10920787 



987654321 






Contents 



Preface vii 

1. Work Conditions and Energy Functions for Ideal Elastic- Plastic Materials 

M. Silhavy 1 

2. Generalized Elastic-Plastic Decomposition in Defective Crystals 

G R Parry 33 

3. Cocycles, Compatibility, and Poisson Brackets for Complex Fluids 

H. Cendra, J. Marsden, and T. S. Ratiu 51 

4. Defect-Induced Transitions as Mechanisms of Plasticity and Failure 
in Multifield Continua 

O. B. Naimark 75 

5. Extended Thermodynamics: A Multifield Theory Par Excellence 

I. Muller 115 

6. Microstructure and Turbulence in Dilute Polymer Solutions 

E. De Angelis, C. M. Casciola, P. M, Mariano, andR. Piva 127 

7. Fluxes and Flux-Conjugate Stresses 

R. Segev 149 

8. Algebra, Geometry, and Computations of Exact Relations for Effective 
Moduli of Composites 

Y. Grohovsky 167 

9. A Multifield Theory for the Modeling of the Macroscopic Behavior 
of Shape Memory Materials 

D. Bernardini and T. J. Pence 199 

10. Balance at a Junction among Coherent Interfaces in Materials 
with Substructure 
G. Capriz and P M. Mariano 



243 




Preface 



To achieve design, implementation, and servicing of complex systems and struc- 
tures in an efficient and cost-effective way, a deeper knowledge and understanding of 
the subtle cast and detailed evolution of materials is needed. The analysis in demand 
borders with the molecular and atomic one, spanning all the way down from classical 
continua. The study of the behavior of complex materials in sophisticated devices 
also opens intricate questions about the applicability of primary axioms of continuum 
mechanics such as the ultimate nature of the material element itself and the possibility 
of identifying it perfectly. So it is necessary to develop tools that allow us to formulate 
both theoretical models and methods of numerical approximation for the analysis of 
material substructures. 

Multifield theories in continuum mechanics, which bridge classical materials 
science and modem continuum mechanics, provide precisely these tools. Multifield 
theories not only address problems of material substructures, but also encompass 
well-recognized approaches to the study of soft condensed matter and allow one to 
model disparate conditions in various states of matter. However, research in multifield 
theories is vast, and there is little in the way of a comprehensive distillation of the 
subject from an engineer’s perspective. Therefore, the papers in the present volume, 
which grew out of our experience as editors for an engineering journal,^ tackle some 
fundamental questions, suggest solutions of concrete problems, and strive to interpret 
a host of experimental evidence. In this spirit, each of the authors has contributed 
original results having in mind their wider applicability. 

In Chapter 1, M. Silhavy deals with the energetics of ideal elastic-plastic crys- 
talline materials under large deformations, showing that the convexification of a 
dissipation function (introduced to describe the set of possible free energies) pro- 
vides the optimal lower bound of the plastic work. New subtle consequences ensue 
from the celebrated Il’yushin condition: for example, the existence of a new kind of 
energy function and the validity of a condition stronger than the common normality 
condition. 

In Chapter 2, G. Parry also investigates crystal plasticity: When defective crys- 
tals are described by lattice vectors, one is able to deduce elastic invariants under 
deformation paths. An unusual but physically significant decomposition of the de- 
formation gradient into elastic and plastic parts is obtained by purely geometrical 
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arguments, without resorting to the notion of stress; thus, issues in kinematics are 
discerned from issues associated with interactions. The latter ones are, in fact, power 
conjugated with the former and are, in this sense, subsequent to them. 

Motivated by questions that arise in the mechanics of complex fluids, such 
as spin glasses, H. Cendra, J. E. Marsden, and T. S. Ratiu introduce, in Chapter 3, 
in the general context, related Poisson brackets with cocycles. Possible connections 
with the variational multisymplectic approach are suggested, with implications for 
further research. Thus the subtle and geometrical nature of mechanics, in particular 
mechanics of complex bodies, is emphasized. 

In Chapter 4, O. Nairmark discusses physical interpretations of a wide number 
of detailed experimental results on defect-induced transitions and studies their conse- 
quences for the macroscopic irreversible mechanisms of metal plasticity and failure. 
Dynamic phenomena involving solitary waves are characterized by their connection 
with the essential features of the defect substructure of bodies under experimental 
examination. Here as well, the data and the qualitative behavior lead to additional 
theoretical questions. 

Even simple substances, such as monoatomic gases, sometimes display complex 
behavior. When a monoatomic gas is rarefied, the Navier-Stokes-Fourier scheme 
does not hold satisfactorily. Extended thermodynamics become a necessary tool, and 
hundreds or thousands of moments of the distribution function enter the picture. In 
Chapter 5, 1. Muller examines this issue, provides an analysis of the scattering of light 
in a gas, and investigates the shock structure in a shock tube. 

When turbulence is present, drag reduction is observed in polymeric fluids as 
a consequence of their macromolecular substructure. When the polymeric flow be- 
comes semidense, a multifield approach can be used to show that the terms involving 
spatial gradients (in particular Laplacians), required in the balance equations, are con- 
sequences of microstress, that is, of contact interactions between neighboring fami- 
lies of polymeric chains. This matter is discussed by E. De Angelis, C. M. Casciola, 
P. M. Mariano, and R. Piva in Chapter 6. 

The choice of the manifold M of substructural states can be, at times, quite 
unusual. Nevertheless the material substructure can always be characterized by an 
abstract, extensive property. The notion of stress still makes sense and can be achieved 
without the notion of “velocity.” The material elements are considered as integral 
manifolds of a flux vector field. Such a point of view is discussed in Chapter 7 by 
R. Segev, and the results are illustrated by an application to the case in which the 
extensive property is the electric charge. 

Composites are natural examples of complex materials, and they may be con- 
structed in a number of different ways. For polycrystals, one possible method of 
analysis is to deduce “effective” elastic constants by homogenization. However, in 
general, there is no guarantee that the tensors of material properties be symmetric. 
Nevertheless, exact relations among material constants can be obtained which are 
preserved by the homogenization procedure. They give us information on the re- 
sulting composite, regardless of the complexity of the material texture. This issue is 
discussed by Y. Grabovsky in Chapter 8. 
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Shape memory behavior arises in the study of materials undergoing solid-to- 
solid transformations between austenite and martensite phases. Their exotic behavior 
is a natural proving ground for the potential of multifield theories. In Chapter 9, 
D. Bemardini and T. J. Pence consider three morphological descriptors of the sub- 
structural state: Two are of a scalar nature, and the third is a second-order tensor. This 
approach accounts not only for pseudoelasticity and shape memory behavior, but also 
for the high temperature reorientation of martensitic variants. 

Interfaces and junctions are common in complex materials. Often interfaces are 
characterized by their own surface energy, and an additional energy accrues along 
junctions. The surface energy must be positive, while, at the junction, there may 
occur a change of sign. The modeling of sharp interfaces and junctions becomes more 
intricate when one views them as structured surfaces or lines. Surface microstresses 
and self-forces, as well as line stresses and self-forces, enter the picture and need to 
be balanced. The matter is discussed in Chapter 10 by G. Capriz and P. M. Mariano. 

This collection of invited papers had its origins at Taormina, Italy, during the 
2001 International Congress of the Italian Association for Theoretical and Applied 
Mechanics (AIMETA); it was there that many of the authors discussed with us the 
basic organization of this text. We are grateful to the organizers of the Congress and 
to the Italian National Group of Mathematical Physics for the grants that made our 
work possible. Furthermore, we express our appreciation to Tom Grasso and his staff 
at Birkhauser Boston for providing kind support. Last but not least, we thank Nicola 
Bellomo for accepting our book into his series Modeling and Simulation in Science, 
Engineering and Technology. 



Gianfranco Capriz 
Paolo Maria Mariano 
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1. Work Conditions and Energy Functions for 
Ideal Elastic-Plastic Materials 

M. Silhavy* 



Abstract. The paper deals with the cyclic second law and Il’yushin’s condition 
for isothermal, ideal, isotropic, elastic-plastic materials at large deformations. The 
second law is equivalent to the existence of the elastic potential and the nonnega- 
tivity of plastic power. The material admits infinitely many free energies: The set 
of all energy functions is described in terms of a dissipation function. Its convex- 
ification provides the optimal lower bound for plastic work; it also figures in the 
maximal and minimal energies. H’yushin’s condition is equivalent to the existence 
of the elastic potential and a new condition that is stronger than the normality of 
the plastic stretching and the convexity of the stress range. Il’yushin’s condition 
is also equivalent to the existence of a new kind of energy function called “the 
initial and final extended energy functions.” Materials of type C are introduced 
for which the initial extended energy function has additional convexity properties. 
It can be viewed as a stored energy of a Hencky hyperelastic material associated 
with the elastic-plastic material. 



1.1. Introduction 

This paper deals with the cyclic second law and Il’yushin’s condition for isothermal, 
ideal, isotropic elastic-plastic materials at large deformations. The ideal nature of the 
material excludes hardening. Both the isothermal cyclic second law and Il’yushin’s 
condition assert the nonnegativity of the work of external forces on for certain classes 
of cyclic processes. The former speaks of cycles in the state space; the latter of cycles 
in the space of total deformations. 

Sections 1.3-1. 5 analyze the material from the point of view of the general 
theory of actions and potentials on thermodynamical systems by Coleman and Owen 
[5]. That theory provides an unbiased derivation of potentials like the free energy; 
moreover, it enables one to discuss their uniqueness/nonuniqueness and to determine 
the maximal and minimal potentials. For materials treated here, the plastic flow rule 
can be “nonassociated,” and the restrictions equivalent to the second law are the 
existence of the elastic potential for the elastic stress function and the nonnegativity 

* Mathematical Institute of the AV CR, &tna 25, 115 67 Prague, Czech Republic, silhavy@math. 
cas.cz and Department of Mathematics, University of Pisa, Via F. Buonarroti 2, 56127 Pisa, Italy, 
silhavy@dm.unipi . it. 
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of the plastic power in every process. Using that, and defining the free energy as any 
state function satisfying the dissipation inequality, one finds that the elastic potential is 
a possible free energy. However, the free energies are highly nonunique here: Infinitely 
many of them differ mutually by a nonconstant state function. A complete description 
of the set of all free energy functions is given using a dissipation function that is 
closely related to the plastic power. The central result of this part says that the optimal 
lower bound for the plastic work is the convexification of the dissipation function. The 
maximal/minimal free energies are the elastic potential plus/minus the convexification 
of the dissipation function at the logarithmic plastic stretching. Analyses of elastic- 
plastic materials from the standpoint of [5] have been given previously by Coleman 
and Owen [6], [7] for unidimensional infinitesimal ideal materials, by Lucchesi [14] 
for three-dimensional infinitesimal materials with hardening, and by Lucchesi and 
Silhavy [17] for large deformation three-dimensional infinitesimal materials with 
hardening. 

In the context of infinitesimal deformation theory, ITyushin’s condition is clas- 
sically known to imply the convexity of the stress range (CSR) (the stress range is 
the region below the yield surface) and the normality rule (NR) asserting that during 
loading, the plastic stretching is proportional to the exterior normal to the yield surface 
(e.g., [25]). The analysis has been extended to large deformations by Lucchesi and 
Podio-Guidugli [15] (see also [16] and [10]). Their analysis showed that ITyushin’s 
condition still implies the CSR and NR provided that the stress range is interpreted 
as the set of elastically reachable Kirchhoff stresses (as opposed to Cauchy stresses; 
see Section 1.2 for definition); in addition, it implies the existence of the elastic po- 
tential (EP). However, their analysis shows that conversely EP, CSR, and NR lead to 
Il’yushin’s condition only for deformation cycles that are small in a precisely defined 
sense. The main feature of the analysis in Sections 1.6-1 .7, and what distinguishes 
it from the previous work, is the possibility of treating arbitrarily large deformation 
cycles. Namely, ITyushin’s condition is shown to be equivalent to EP and a new 
Condition E (Condition (ii) of Theorem 2). Condition E is stronger than NR and 
CSR. On the basis of Il’yushin’s condition for large cycles, new energy functions 
are constructed satisfying dissipation inequalities stronger than those based on the 
second law. Their main feature is that they are nonlocal in time (have no localized 
counterparts). There are two types of the new energy functions; I call these the initial 
and final (extended) energy functions. 

In Sections 1.9-1.10 materials called materials of type C are studied that per- 
mit a more explicit analysis. The main requirement is a restricted convexity in the 
logarithmic deformation on symmetric deformation gradients of fixed determinant. 
Materials of type C satisfy IPyushin’s condition and the initial extended energy admits 
a description in terms of convex-conjugated functions in the logarithmic deformation. 
The initial extended energy can be interpreted as a stored energy of some hyperelastic 
material; this material is the large deformation analogue of the hyperelastic material 
in the infinitesimal Hencky theory of plasticity. Its stress relation coincides with the 
nonlinear elastic response of the plastic material for deformation gradients from the 
original elastic range; if the deformation is outside the elastic range, then the stress 
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is determined by some nonlinear projection onto the elastic range (in the space of the 
logarithmic deformations; see Section 1.9 for a precise description). Unexpectedly, 
the extended energy grows logarithmically at large deformations, in contrast to the 
infinitesimal Hencky theory, where the growth is linear (cf. Temam [31]). As an il- 
lustration, an elastic-plastic material of type C is considered for which the extended 
energy and all other derived objects can be calculated explicitly. 

The above description makes it clear that the logarithmic deformation and loga- 
rithmic convexity/concavity emerge naturally from the analysis. The reason is that for 
large deformations theory, the processes of minimal dissipation are those for which 
the logarithm of plastic deformation, and not the plastic deformation itself, is linear 
in time. 



1.2. Ideal elastic-plastic materials 

Let Lin denote the set of all second-order tensors on a three-dimensional space; we 
u se the scalar product A • B = tr(AB^) on Lin and the euclidean norm |A| := 
\/A • A. The deformation gradients are interpreted as the elements of the set Lin"^ 
of all second-order tensors with positive determinant. Sym^ is the set of all positive 
definite symmetric tensors. 

For the considerations of the paper, it is convenient to describe the response in 
terms of states and processes (cf. [20], [5], [6], [7], [28], [16], [29], [2], [3], [24]). 
Hie state space E is 

5} = {a = (E, P) : E € f , P 6 Unim] = 5 x Unim, 

where £ c Lin"^ and Unim is the set of all tensors with determinant 1. The set £ is 
closely related to the elastic range (see below). The states a are pairs (E, P) where 
E G is the elastic deformation and P is an unimodular plastic deformation. The 
total deformation and the Kirchhoff stress of a = (E, P) are 

F(a) := EP, T(cr) := T(E), (1) 

where T : 5 -> Sym is a given function and Sym is the set of all symmetric tensors. 
Equation (1) expresses the multiplicative decomposition of the deformation gradient 
into the elastic and plastic parts [13]. The Kirchhoff stress T := (detF)Tc is more 
convenient than the Cauchy stress Tc here. Let S : 5 Lino denote the mapping 
defined by 

S(E) = E^f (E)E“^ - i(trT(E))l, 

where Lino is the set of all traceless tensors. The definition is motivated by the 
expression for the plastic work (4) below. The stress range S is defined by 

5 := S(£). 

To specify the class of processes 11 , we assume that the plastic deformation changes 
only when E is on the boundary d£ of £ and that the direction of the plastic stretching 
is determined by a prescribed function M mapping 95 into the set of all traceless 
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symmetric tensors SymQ, normalized by |M(E)| = 1, E € 35. Thus we assume the 
flow rule of the form 



BP(t) = a(OM(E(r)), 



( 2 ) 



where the plastic stretching D^(/) is the symmetric part of L^(t) = P(r)P(0~^ 
and a(t) > 0 is a coefficient of proportionality. Formally, FI consists of all functions 
7T = (E( ), P( )) mapping closed intervals of the type [0, djt], dj^ > 0 into E that are 
continuous and piecewise continuously differentiable and that satisfy the following 
condition: For every t e [0, d^] for which D^(0 0, one has E(0 € d£ and (2) 

holds for some a{t) > 0. For convenience it is assumed that every process starts at 
time / = 0. The number dj^ can be different for different processes and it is called the 
duration of the process. The states = 7t(0) and = 7v(djt) are called the initial 
and final states of the process tt. We also write = (E\ P^, = (E^, P^), and 

= F(7t 0, F^ = F(7T^). The considerations in the paper require the construction of 
new processes from given ones via the operation of continuation. If tti , 7T2 € 0 are 
two processes with 7r[ = then the continuation n\ * 7T2 of it \ with 7T2, is 



TTi *;T2(0 = 



7Tl(0 

7t2(t-dn^) 



if r € [O.djr^], 
ift € Kp^TTi 



For every 3 > 0 let 



5^ = {A e 5 : det A = 3} . 



( 3 ) 



A set M C Lin is said to be objective or isotropic if 

RA e M, RAR^ e M, 



respectively, for every A € M and R € Rot. Here Rot is the proper orthogonal group. 
A function / : M R, M C Lin is said to be objective or isotropic if M is objective 
or isotropic, respectively, and 

/(RA) = /(A), /(RAR^) = /(A), 

respectively, for every A e M and R e Rot. A function G : M Lin is said to be 
objective or isotropic if Af is objective or isotropic, respectively, and 

G(RA) = RG(A)R^, G(RAR^) = RG(A)R^, 

respectively, for every A e M and R e Rot. G is said to be scalar-objective if M is 
objective and 

G(RA) = G(A) 

for every A e M and R € Rot. 



Definition 1. The objects 5, T, M are said to determine an ideal elastic-plastic ma- 
terial if they satisfy the following conditions: 

(i) £ is a closure of its interior, any two points of£ can be connected by a piecewise 
smooth curve in 5, and T is continuous. 

(ii) For every C 6 Sym^, \C\ = \,we have M(E) = Cfor at least one E 6 d£. 
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(iii) S is bounded and S maps the interior and the boundary of £ onto the interior 
and boundary of S, respectively. 

(iv) For each 5 > 0, the set £s is nonempty. 

(v) The set £ and the function T are objective and isotropic and the function M is 
scalar-objective and isotropic. 

The quadruple M = (S, 11, F, T) is called the ideal elastic-plastic material 
or, briefly, the material. The material just constructed is a special case of a material 
with elastic range (cf. [25], [22], [23], [27], [15], [16]). For processes starting at 
a = (E, P) and remaining in the elastic range £{cr) := £P the material behaves like 
an elastic material with the elastic response 

T*(ct,F) = t(FP-'), Fef(a). 

Item (v) of Definition 1 expresses the objectivity and isotropy of the material. 
The objectivity and isotropy of £ and T need not be commented; however, note that 
they imply that S is scalar-objective and isotropic; moreover, its values are sym- 
metric tensors. The scalar-objective nature of M is consistent with the following 
transformation rules under a change of frame and change of reference configuration: 
If 7T = (E(-), P(0) is a process, Q a piecewise continuously differentiable function 
on [0, dji] with values in Rot, and R a fixed element of Rot, then 

F h-> QF E i-> QE, P P, 

F i-> FR E i-> E, P PR. 

Other transformation laws are possible since the plastic deformation is determined 
only to within a rotation in isotropic materials [27]. 



1.3. The cyclic second law: Its first consequences 



The work of external forces on the material in the process tt = (E(-), P(-)) is 

rd 

w{tx) = / T • lidt, 






where L = FF ^ F = EP, d djt, and T = T(E) is the time evolution of the 
Kirchhoff stress in the process. Using the isotropy and symmetry of T we find that 

u;(7t) = w^{n) -F w^{n), 



where 









T • EE"^ dt. 



w^n) 






S-D^ dt 



(4) 



are the elastic and plastic works in tt, respectively, where S = S(E) is the time- 
evolution of the traceless part of the Kirchhoff stress. A process n is said to be a 
(7 -cycle if 7T^ = 7T^ . For isothermal materials the second law reduces to the following 
assertion [5]. 



The cyclic second law. For every a -cycle ;r, u;(7r) > 0. 
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A function p : S R is said to be an elastic potential for T if 

p(E^) - p(E') = w^{n) (5) 

for every process. It follows that if p exists, it is objective and isotropic and contin- 
uously differentiable in the interior of S with the stress relation T(E) = Dp(E)E^ 
prevailing there. 

Proposition 1. The material satisfies the cyclic second law if and only ifT has an 
elastic potential and 

S(E) • M(E) > 0 (6) 

for every E e d€. 

Inequality (6) says that the plastic power, and hence also the plastic work, is 
nonnegative: S • = a(t)S • M > 0. Let us emphasize that this consequence does 

not hold for general, nonideal, elastic-plastic materials. 

Proof Suppose that the material satisfies the cyclic second law. To prove the existence 
of the potential, consider the elastic process 7T = (E(-), 1), where E(-) : [0, 1] f is 
a path with values in f . If E(0) = E(l) then the cyclic second law asserts w{tx) >0. 
Applying the same to the time reversal tt we obtain w{7t) = —w{tx) > 0 and thus 
w{n) = 0. Since w{n) = the vector field T is path independent on £\ hence 

it has an elastic potential. Proof of (6): Let E e d£ and set D := M(E). Consider 
an orthonormal basis of eigenvectors of D so that D = diag(Ji, ^2, Let be 
the group of all 3 by 3 permutation matrices Z which we identify with orthogonal 
tensors Z in our basis. Enumerate the elements of P^ arbitrarily to obtain a sequence 
Za, O' = 1, . . . , n = 3!. Since 

Z«DZj = diag(J^^(i), t/a«(2), ^(Ja(3)), 

where is the permutation corresponding to Z«, the family {Z^DZ^, a = 1, . . . , «} 
is commutative. Using trD = 0, one finds that 

n 

J2^aBZl = 0. (7) 

a=l 

Define inductively the processes = (Ea( ), ?„(•)) of duration 1 by 
Ei(0 = Z,EZ[, Pi(0 = Zie°'Z[, 

andfora>l, 

E„(0 = Z„EZj, P„(0 = Z^cD'Zjpf 
t e [0, 1]. These are really processes and one finds that 0^(0 = Z^DZj and 

W^(7Ta) = S(E) • M(E). 

Let Pof = (Uof (•), P^)), O' = 1, . . . , n, be elastic processes such that 
uL = Z„EZ^, Uf=Z„+iEZ^+„ 



( 8 ) 
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if i < n, and 

U‘=Z„EZ[, Uf=ZiEZ[. 

One finds that the process tt = (E(-), P(-)) := tti * pi * ;t 2 * P2 * • * * Pn can be 

constructed from 

E‘ = ZiEZi, E^ = ZiEZi, 

and 

n 

P' = 1, Pf = exp[^ Z„DZ[] = 1. 

a=l 

Here we use (7) and the commutativity of the family {ZaDZj, a = 1, As 
also = E^, the process n is cyclic. Since w^(7t) = 0 by (5), w(7t) > 0 reduces, 
by (8), to 

u;^(7r)=nS(E)-M(E)>0 

and (6) follows. Conversely, if the two conditions of the theorem hold and tt is a 
cyclic process then w^{7t) = 0 by the existence of the potential and u;^(7r) > 0 by 
(6). Hence the cyclic second law holds. □ 

1.4. The free energy functions 

Any function : E ^ R satisfying 

t/r(7rf) - < Win) (9) 

for every process n is referred to as the free energy function. Using the nonnegativity 
of the plastic power, it is shown below that the elastic potential is one example of 
the free energy function, and a description is given of all free energies. The specific 
features of the material imply that any free energy function splits into a sum of the 
reversible elastic potential and a residual energy function that depends only on the 
plastic deformation. 

Proposition 2. Let the material satisfy the cyclic second law and let p be its elastic 
potential. A function : E -> R a free energy function if and only if 

t/r(a) = p(E) + r(P) (10) 

for every cr = (E, P) e E, where r : Unim ->• IR satisfies 

r(P^)-r(P') <w^in) (11) 

for every process tt = (E(-), P(*)) ^ LI. In particular, the function ^ : E — > R 
given by 

rlr(a) = p(E), (12) 

or = (E, P) G E is a free energy function. 
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Let us emphasize that a general (nonideal) elastic-plastic material need not have 
a free energy independent of plastic deformation. Every function r satisfying (11) in 
every process is called the residual energy function. The problem of describing all 
free energy functions reduces to that of describing all residual energy functions. 



Proof By Proposition 1, T has an elastic potential p and the plastic work is nonneg- 
ative in every process. If is a free energy function and P € Unim, then for every 
path E(-) : [0, 1] -> f the process n = (E(-), P) is elastic and as w{7t) = w^{ix) is 
given by (5), the dissipation inequality (9) reads 

V^(Ef,P)-Vr(E‘,P)>p(Ef)-p(E‘). (13) 

Replacing ix by its time-reversal tt we obtain the opposite inequality and hence the 
equality must hold in (13). It follows that for each P 6 Unim, V^(-, P) differs by a 
constant r(P) from p which leads to (10). Using (10), one sees that (9) reduces to 
(11). Reversing the direction of the arguments, one finds that every function yjr of the 
form (10) is a free energy function. Since the plastic work is nonnegative, the function 
r = 0 satisfies (11) and hence the given by (12) is a free energy function. □ 

To simplify notation, if A, B € Lin, we write 

A-B 



to mean that |A| = 1, B ^ 0, and A = B/|B|. Define the dissipation function 
m : Symo R by 

m(D) := inf {S(E) • D : E € 35, M(E) ~ D} 

if D 7 ^: 0 and m(0) := 0. Note that the infimum is taken over a nonempty set by 
Definition l(ii). If the material satisfies the cyclic second law, then m is nonegative 
by Proposition 1 . 

If r is a function on an open subset of Unim and P is in the domain of r, then 
r is said to have a total differential at P if the function A r(^-^P), A € Lino has 
a Frechet derivative at 0 in the sense of [8, Part I, Chapter VIII]. The differential 
(derivative) Dr(P) is then defined as the unique element satisfying tr(Dr(P)P^) = 0 
such that 



|r(K0) 



= Dr(P)-y(0) 
«=o 



for every smooth curve in Unim with y(0) = P. Note that Unim is a regular eight- 
dimensional surface in the nine-dimensional space Lin and that the unit normal and 
the tangent space to Unim at A are A“^/1A“^| and {M € Lin : tr(MA“*) = 0} . 
The Haar measure ([8, Part II, Chapter XIV and Part IV, Chapter XIX]) and the 
surface measures on Unim are mutually absolutely continuous and their images in 
any local coordinate chart on Unim are absolutely continuous with respect to the 
eight-dimensional Lebesgue measure in that coordinate chart. Hence there is a well- 
defined notion of almost everywhere. If r is a locally Lipschitz continuous function 
on Unim, Rademacher’s theorem ([19]) asserts the existence of the total differential 
for almost every P 6 Unim. Finally note that if Pi, P 2 € Unim then there exist a 
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unique pair R G Rot, D e Symo such that P 2 = R^^Pi . This follows from the polar 
decomposition = RU by writing U = D g Symo. 

Proposition 3. If the material satisfies the cyclic second law and r : Unim R is 
a function then the following three conditions are equivalent: 

(i) r is a residual energy; 

(ii) for every Pi , P 2 € Unim one has 

r(P 2 )-r(Pi) <m(D), (14) 

where D G SyniQ is determined by the condition that P 2 = R^^Pi for some 
R G Rot; 

(iii) r is locally Lipschitz continuous, objective, and for almost every P G Unim, 

Dr(P)P^ • D < m(D) for all D G SyniQ. (15) 



Proof (i) => (ii): Let (i) hold and let Pi , Pi , D, R be as in (ii). Let us prove that 



r(P 2 )-r(Pi)< 



0 

S(E) . D 



ifD = 0, 



(16) 



where E g Of is any element such that 



M(E) - D. (17) 

Let E be arbitrary if D = 0 or such that E G 9f and (17) holds if D 0. Let 
Q : [0, 1] -> Rot be any continuously differentiable function such that Q(0) = 1 and 
Q(l) = R. Define n = (E(-), P(*)) by 

E(0 = Q(0EQ(ff , P(0 = Q(f)e°'Pi, 

This is a process, D^(0 = Q(r)DQ(r)^, P^ = P 2 , P^ = Pi, and w^(n) = S(E) • 
D. Thus the residual dissipation inequality gives (16). The definition of m(D) then 
gives (14). 

(ii) => (iii): Assume that (ii) holds. Let us first prove that r is locally Lipschitz 
continuous. Recall that it is assumed (Definition l(iii)) S is bounded. Set 



Cl = max{|Sl : S G f } . 

By Definition l(ii), for every D G SymQ, D ^ 0, there exists an E g Of such that 
(17) holds. We then have 



S(E)-D <ci|D|. 

Thus for any Pi , P 2 € Unim we have 

r(P 2 )-r(Pi) <ci|D|, (18) 

where D satisfies P 2 = Rcl^Pi. Since this holds for any Pi, P 2 € Unim, we can 
interchange the roles of Pi , P 2 . One finds that D changes to D = — RDR^ and (18) 
for the interchanged pair provides r(Pi) — r(P 2 ) < ci |D|; hence 

|r(P 2 )-r(Pi)|<ci|D|. 



(19) 
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This inequality obviously implies that r is locally Lipschitz continuous. To prove that 
r is objective, it suffices to note that if P 6 Unim, Q e Rot, then for Pi := P, P 2 = 
QP, (19) reduces to r(QP) = r(P). We finally prove (15). Let P e Unim be such 
that Dr exists. The application of (14) to P 2 = ^®^P, Pi = P, / > 0, provides 

- r(P) < m(Dt) = ;m(D). 

Dividing by r, letting t tend to 0, and using the assumed existence of the total differ- 
ential of r at P gives (15). 

(iii) (i): Let r satisfy Condition (iii). Note first that a standard consequence 
of the objectivity asserts that Dr(P)P^ is symmetric for every P e Unim for which 
the total differential exists. Let us now prove (i); i.e., let us prove that the residual 
dissipation inequality holds for each process tt = (E(-), P(-)i We may assume that 
P(-) is continuously differentiable; otherwise we divide the process into subintervals 
where P( ) is continuously differentiable and apply the forthcoming considerations 
to each such piece of the process. We want to integrate (15) along the path P(-). Since 
the differential Dr exists and satisfies (15) only for almost every (a.e.) P e Unim, it 
may happen that Dr(P(f)) exists for no t e [0, d^]. Nevertheless, assume first that 
Dr(P(t)) exists for a.e. time t and complete the proof in that case first. At the end of 
the proof we shall employ some limiting procedure that reduces the general case to the 
above one. The function s := r(P(-)) is Lipschitz continuous and since s{t) is given 
by the chain rule for a.e. time by our assumption and Dr(P(t))P(t)^ is symmetric, 

i(f)=Dr(P(0)P(0^-D^(0<m(DP(r))<S(E(0)-DP(0 (20) 

for a.e. t € [0, here D^(-) is the plastic stretching of tt. We have also used 
equation (15) and the definition of m(D). The integration of (20) gives the residual 
dissipation inequality. 

To complete the proof in the general case, show that we may perturb P(-) so that 
Dr exists for L-a.e. t e [0, dj^] during the perturbed process. Here L is the Lebesgue 
measure on R. We seek the perturbed path in the form P(-)0 where O e Unim is 
sufficiently close to 1. Using Fubini’s theorem on the product space Unim x R with 
the measure v := /x 0 L, where jx is the Haar measure on Unim, one can prove that 
for /x-a.e. O e Unim the differential Dr(P(0O) exists and satisfies (15) for L-a.e. 
t € [0, J;r]- Thus the above part of the proof can be applied to /x-a.e. process P(-)0 
which gives 

r(P^O) — r(P^O) < w^{n). 

Letting O 1 and using the continuity of r proves the residual dissipation inequality 
in the general case. □ 



1.5. A lower bound for plastic work and the extremal energies 

In this section we calculate a lower bound for the plastic work in processes of fixed 
initial and final plastic deformation and determine the maximal and minimal residual 
energies vanishing at a given point. Both these results are stated in terms of the convex 
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hull m** of the dissipation function m, i.e., the largest convex function on Symo not 
exceeding m. 

The following fact will be useful (see [29, Propositions 18.2.4 and 18.2.5]). 

Proposition 4. Let f : U -> R, f/ = Sym, SyniQ be an isotropic function. Then 
f is convex if and only if its restriction /a to diagonal arguments (relative to some 
fixed orthonormal basis) is convex. 

Theorem 1. Let the material satisfy the second law, let D € SyniQ, and let C(D) be 
the set of all processes n = (E(-), P(0) satisfying = Re^F^ for some R e Rot. 
Then 

m**(D) = inf |it;^(7r) : tt e C(D)| . (21) 

Proof Define s : Unim R by 

j(P) = m**(D), P € Unim, 

where we write P = R e Rot, D G Symo prove that ^ is a residual energy 
function. By the construction, s is objective. The convex function m** is locally 
Lipschitz continuous and hence it has the total differential Dm** (A) for a.e. A € SyniQ 
with respect to the Lebesgue measure on SyniQ. If A 6 Symo P ^ Unim are 
related by P = Re^ for some R e Rot, one finds that m** has a differential at A if 
and only if s has a differential at P. Since m is isotropic, nonnegative, and positively 
homogeneous of degree 1 , so also is m** . Using the isotropy, and working in the basis 
of eigenvectors of A (see, e.g., the corresponding considerations in [1] or [29]), one 
derives the formula 

Ds(P)P^ • D = Dm** (A) • D (22) 

for each D e SyniQ. By the convexity and homogeneity of m**, 

Dm**(A) • D < m**(D) < m(D) 

for every A e Symo, A 0 for which the total differential of m** exists. A combi- 
nation with (22) provides (15). This proves that s is Si residual energy function. Let 
us now prove that for every process tt = (E(-), P(0) ^ C(D), 

w‘"{7t) > m**(D). (23) 

Since ;r' := (E(),P()(P*)~')isalsoaprocess,iu^(n-) = ly^(Tr'), and j is a residual 
energy, 

u)^(jr') > j(pf (P‘)-') - i(l) = m**(D), 

which implies (23). This shows that we have the inequality sign < in (21). 

The rest of the proof is devoted to showing the opposite inequality in (21). Thus 
we seek to prove that for every € > 0 there exists a process tt e C(D) such that 

w^{7t) <m**(D) + €. (24) 

Let {e/} be any orthonormal basis and let A = A(e/) be the space of all A e Symo 
represented by diagonal matrices in {e, } so that all elements of A commute. Let m^ 
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and m*^ be the restrictions of m and m** to A . Let us show that = (m a)** where 
the last symbol denotes the convex hull of m a on A. Since m*^ is convex, we have 

^A* — on A. (25) 

Let us extend the function (rriA)** from A to a function g : SyniQ R by isotropy, 
i.e., by 

g(B) = (mA)**(QBQ^), B e Symo, 

where Q = Q(B) e Rot is chosen so as to satisfy QBQ^ 6 A . Such a Q exists by the 
spectral decomposition theorem. Then, first, it is easily seen that g is well defined, 
i.e., independent of the choice of Q, and second, by Proposition 4, g is convex as 
a consequence of the convexity of (mA)**. The construction gives that g < m on 
SyniQ and thus since m** is the maximal convex function not exceeding m, we have 
g < m** on Symo and in particular < m*^ on A. Thus combining with (25) 

we have (mA)** = m*^. Let now D e Symo and let {e/} be any basis in which D is 
diagonal. The above considerations show that 

m**(D) = mr(D) = (mA)**(D) (26) 



and applying the familiar construction of the convex hull [26] to m a, we obtain that 



(mA)**(D) = inf 



3 

^ A,jm(A|) : A/ E A, A./ 
/=! 




3 

= i,5]x,a,=d , 
/=1 



(27) 



which also gives the value of m**(D) by (26). Here the limit 3 in the sums is related 
to the dimension 2 of A(et ) through the Caratheodory theorem. It is noted, and this 
is the main conclusion of the above considerations, that all the elements A/ as in (27) 
mutually commute and commute also with D. Let € > 0 be given. By (27), there 
exist sequences ki > 0, A/ € Symo, ^ 2, 3, such that 

3 3 3 

m**(D) + i>J2^im(Ai), = 1, (28) 

/ = 1 1 = 1 / = 1 

and A/, D mutually commute. We can also assume that A/ 7 ^ 0 for f = 1,2, 3. By 
the definition of m, for each / = 1, 2, 3 there exists a E/ e BE such that 

M(E/)-A/ and m(A/) + ~ > S(E/) • A/, / = 1,2,3. (29) 

The desired process is constructed in the form tt = tti * pi * 7 T 2 * P 2 * tts as follows. 
We take tt/, / = 1 , 2 , 3, as processes of duration A./, where tt/ = (E/, P/(-)) and 

Pi(0 = 

P2(0 = 

P3(f) = eA3/^X2A2gXiAi^ 



t 6 [0, A-i], 

t € [0, A.2], 
t 6 [0, kil 
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We further take p\, P 2 as elastic processes of the form p\ = (Ei(-), Pi = 

(Ei(-), such that Ei = E\, E 2 = Ej, E 2 = %, E 3 = Ej. Then the 

process it = n\ * p\ *tc 2 * P2*'^'i can be constructed and 

pi _ j pf _ gA.iAi+X2A2+X3A3 _ gD 

where we have used the commutativity of the A,- and (28)3. Thus n e C(D). Fur- 
thermore, 

3 3 3 

= Y, = I2^<S(E,) • A; < J]Xim(A,) + ^-< m**(D) -|- e 

i=i (=1 i=l 

by (29)2 and (24). □ 

A residual energy function r is said to be maximal or minimal at H € Unim if 
r(H) = 0 and r > r or r < r, respectively, for any residual energy function r such 
that r (H) = 0. Coleman and Owen [5] show that the set of all free energy functions 
that vanish at a given state is convex and has the largest and the smallest elements. 
By Proposition 2(ii) the problem of describing the extremal free energy functions is 
equivalent to that of describing the extremal residual energy functions. 

Proposition 5. Let the material satisfy the second law and define s,t : Unim -> 

Rby 

^(P) = m**(D), r(P) = 

P G Unim, where we write P = Re^, R e Rot, D € SymQ. Then 5 , t are the 
maximal and minimal residual energies at 1. Moreover they satisfy 

^(OP) < s(0) + 5 (P), r (OP) >t(0) + t(P) (30) 

for any O, P € Unim. 

The extremal residual energies at a general H e Unim are 
5 h(P) = ^(PH), rH(P) = ^(PH), 

P e Unim. 

Proof By the proof of Theorem 1 , 5 is a residual energy; the proof that t is a. residual 
energy is similar. To prove that s is maximal at 1, let r be any residual energy vanishing 
at 1 , let 7t = (E(-), P(*)) t>e any process in C(D), and let n' = (E(-), P(-)(P0~^)- 
The residual dissipation inequality for tt' reads 

r(P^(P^)”^) = r(e^) < {n') = {it). 

Taking the infimum over all processes n e C(D) and using Theorem 1 we obtain 

r{e^) < s(e^). 

Combining with the objectivity of both r, s this gives r < .y. To prove (30), consider 
only s. For any function r on Unim and any H e Unim let rjj denote the shifted 
function given by rn(P) = r(PH). It follows immediately from, e.g.. Condition (hi) 
of Proposition 3 that r is a residual energy function if and only if rn is a residual 
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energy function. Note that for any P e Unim, the function ^p(-) — ^(P) is a residual 
energy vanishing at 1. Hence .yp(-) — ^(P) < s(-) and (30)i follows by inserting O. 
The rest is immediate. □ 



1.6. Il’yushin’s condition: Its first consequences 

Aprocess tt = (V(-), P(-)) is said to be an F-cycle if F(jr^) = F(7 t 0. Every or -cycle 
(see Section 1.3) is an F-cycle. 

IPyushin’s condition. For every F-cycle n, w{n) >0, 

Theorem 2, below, describes the consequences of Il’yushin’s condition on the 
constitutive objects, which include the normality rule. 

Definition 2. The material is said to obey the normality rule if for every E e d£, 

M(E) 6 Ns(E)<5, (31) 

where denotes the normal cone to S at S(E), defined by 

Ns<5 := {D e SyniQ : (Z — S) • D < 0 for every Z G 5| , 

S E SyniQ. 



The following lemma gathers some consequences of the normality rule. 



Lemma 1. If the material obeys the normality rule then 

(i) the stress range S is convex; 

(ii) for every D E SyniQ, 

m(D) = sup {S • D : S E 5} 

and in particular^ 



m(D) = S-D 



for any S e S such that D E Ns<S; 

(iii) m is isotropic, convex, and positively homogeneous of degree 1; 

(iv) the subdifferential of m is given by 



9m (D) = 



{S E SyniQ : D e Ns5} 

5 



if D ^ 0, 

ifB = 0 ; 



(v) the convex conjugate m* ofm is given by 



(32) 

(33) 



(34) 



m*(S) = 



0 ifS E 5, 
00 otherwise. 



(35) 
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Proof, (i): Let us derive (i) from the following assertion, which is easy to prove: 
IfM C is a closed set with nonempty interior such that for each x e dM we 
have NjcM 7!^: {0} , then M is convex. Let us verify that 5 satisfies the hypotheses 
of the assertion. Clearly, S is closed since it is an image of the closed set £ under 
the continuous mapping S. Further, S has nonempty interior since £ has nonempty 
interior by Definition l(i) and (iv) and S maps the interior of £ onto the interior of S 
by Definition l(iii). Finally, for each S e 95 we have Ns5 7*^: {0} since S maps 3£ 
onto dS by Definition l(iii) and for each E G 95 we have (31). (ii): Let D 0 and 
E € 95 be any point such that M(E) ~ D so that D e by the normality rule 

which means that (S — S(E)) • D < 0, i.e., 

S D < S(E) D 

for any S e 5. Fixing S and taking the infimum of the right-hand side over all E g 95 
such that M(E) ~ D we obtain 

S • D < m(D) 

with the equality holding if S = S(E), where E is any element as above. Once (ii) has 
been established, (iii)-(v) follow from the standard duality theory for homogeneous 
degree 1 convex functions [9], [26]. □ 

Theorem 2. The material satisfies ITyushin's condition if and only if the following 
two conditions are satisfied: 

(i) T has an elastic potential p\ 

(ii) i/D G Symo and E, Ee“^ G 5 then 

> p(E) - m(D). (36) 

Moreover, if (\) and (ii) hold then the material obeys the normality rule. 

Item (ii) is called Condition E in the subsequent discussion. 

Proof. Assume that the material satisfies Il’yushin’s condition. Condition (i) follows 
from the reversibility of elastic processes via the path-independence argument as in 
the proof of Proposition 2(i). (ii): Let Eq G 95 be such that 

M(Eo) D. (37) 

Consider a process tt = tti * tto * 7T2, where 7t\ is any elastic process connecting 
(E, 1) with (Eo, 1), 7T2 any elastic process connecting (Eo, e^) with (Ee“®, R^^), 
and JTo a process of duration 1 with 

Eo(0 = Eo, Po(f) = e®', ^e[0,l]. (38) 

Then ttq is really a process by (37) and the process tt\ *7ro*7r2 is an F-cycle. Il’yushin’s 
condition says 

w{7t) = W^{7t) -f W^{7T) > 0, 

where = p(Ee~^) - p(E), w^(7t) = w^(7to) = S(Eo) • D. The last three 

relations yield p{Ee~^) > /7(E) — S(Eo) D. Since Eo is arbitrary subject to condition 
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(37), the definition of m gives (36). This completes the proof of (ii). Assume that (i), 
(ii) hold and prove the normality rule. Let G e d£. The goal is to prove that 

(S - S(G)) • M(G) < 0 (39) 

for every S e S. Assume first that S is in the interior of 5. Use Definition l(iii) to 
find that there exists an interior point E of 5 such that S = S(E). Set D = M(G) and 
note that for all t > 0 sufficiently small we have Ee~^^ e £ since E is in the interior 
of £. The application of (36) and the use of m(tD) < tS(Eo) • D, which follows from 
the definition of m, provide 

p(Ee~'°) > p(E) - tS(G) ■ D. 

Dividing by letting t tend to 0, and using the stress relation, we obtain S(E) • D < 
S(G) • D and thus eventually (39). Since £ is the closure of its interior, the limit gives 
(39) for each S = S(E), where E is a boundary point of £, and since every boundary 
point of S is of the form S = S(E), where E is a boundary point of £ (see Definition 
1), inequality (39) holds for all S e «S. Assume conversely that (i) and (ii) hold and 
prove Il’yushin’s condition. Let n = (E(-), P(0) be an F-cycle. Since (i), (ii) imply 
the normality rule, Lemma l(iii) says that m is convex and thus m = m**. Then by 
Theorem 1, 



w(7t) = w^(7t) + w^(7t) > p(E^) — p(E^) + m(D), (40) 

where D is such that = R^^P' for some R € Rot. Combining with E^P^ = ET^ 
we obtain E^ = E‘^“^R^ and (40) reads u;(7r) > p(E^e~^) — p(E^ -f m(D) > 0, 
where the last inequality is Condition (ii). □ 

Remark 1. Suppose that the material obeys the normality rule and that T has an 
elastic potential p. Then we have the following: 

(i) Condition E holds for all pairs as in that condition with |D| sufficiently 
small and E in the interior of £. 

(ii) If£ is logarithmically convex in the sense that for every pair q/*E, D of tensors 
with D € SyniQ, E, Ee~^ e £ one also has Ee~^^ e £ for every t e [0, 1], 
then E holds. 

The logarithmic convexity of Item (ii) seems to be hard to verify. The following 
section gives other sufficient conditions to guarantee E. 

Proof (i): Since E is an interior point of £, if |D| is small enough, then Ee~^^ e £ 
for all t e [0, 1]. Writing H(/) := E^”^^ one obtains dp(R(t))/dt = — S(H(0) • D. 
Lemma l(ii) gives S(H(t)) • D < m(D); thus dp(Ji(t))/dt > — m(D) and the 
integration over [0, 1] yields (36). (ii): If £ is logarithmically convex, then in the 
notation of the proof of (i), one has H(t) € £ for all t € [0, 1]. The proof is then 
identical with that of (i). □ 
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1.7. The extended energy functions 

This section shows that IFyushin’s condition leads to energy functions that satisfy 
the dissipation inequalities stronger than those arising from the second law. 

Theorem 3. Suppose that the material satisfies IVyushin^s condition. Let F e Lin^ 
and define 

e(F) = inf |/?(F^~^) + m(D) : D e SyniQ, F^“^ € f | , (41) 

/(F) = sup I p(Fe~^) — m(—D) : D e SymQ, Fe~^ 6 f | . (42) 

Then 

(i) —oo < f <e < oo\ 

(ii) for every ¥ e S the infima and suprema in (41) and (42) are attained for D = 0 
and hence 

e(¥) = /(F) = p(F); 

(iii) ifF^£ then (41) and (42) hold with the condition Fe”^ € £ replaced by 

¥e~^ e d£\ 

(iv) for every process it = (E(-), P(-))» 

e(pf(pi)-i) _ e(F(P‘)-^) < (43) 

/(Ff(pf)-i) _ /(F(P^)-') < win). (44) 

The proof will show that all the infima and suprema in the above theorem are 
taken over nonempty sets but it is not apriori clear that they are finite. The function 
e is called the initial extended energy or briefly initial energy and / the final ex- 
tended energy or final energy. For a concrete material, the function e is calculated 
in Section 1.10. Since for F-cycles the left-hand sides of (43) and (44) vanish and 
thus u;(7r) > 0, we see that the existence of a function e : Lin^ R satisfying (43) 
or similarly the existence a function / : Lin"^ -> R satisfying (44) implies that the 
material satisfies Il’yushin’s condition; thus the existence of such functions is equiv- 
alent to IFyushin’s condition. Since Il’yushin’s condition is strictly stronger than the 
cyclic second law, the dissipation inequalities (43) and (44) are strictly stronger than 
the dissipation inequality (9) stemming from the second law. In Remark 2 and the 
subsequent discussion we shall see that there are qualitative differences between the 
inequalities (43) and (44). It is also noted that similar potentials have been introduced 
by Lucchesi and Silhavy [16], but the dissipation inequalities have been proved only 
for a restricted class of processes. Recently, a potential similar to e has been intro- 
duced by Ortiz and Repetto [21] and Carstensen, Hackl and Mielke [4], and Mielke 
[18] to treat plastic materials from the variational point of view. 

Proof (i): If we set ^ = det F and take any element E of £^ (see Definition l(iv)) then 
by the polar decomposition theorem there exists an R e Rot and D e SyniQ such that 
¥e~^ = ER and hence ¥e~^ e £ for some D e Sym^. Thus the suprema and infima 
in (41) and (42) are taken over nonempty sets and hence e(F) < oo, /(F) > — oo. It 
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remains to be proved that /(F) < ^(F). Note first that if Di , 62, D e SyniQ are such 
that for some R € Rot, then we have the triangle inequality 

m(b) < m(Di) + m(D2). (45) 

To establish (45), let e 95, o' = 1, 2, be such that 

M(Ec,) ~ D« (46) 

(see Definition l(ii)). Let^r^ be processes of duration 1 oftheformjTa = (E«, ?«(•))» 
where 

P2(0 = e'®^ Pi(f) = e'0>e°2, re [0,1], 

Let finally p be an elastic process connecting with . Then tt = 7T2 * p * tti is a 
process in which 

w^{ 7 x) = S(Ei) • Di + S(E2) • 62 

and we have = R^®. Thus the lower bound for the plastic work 

(Theorem 1; recall m = m**) gives 

S(Ei) • Di -|“ S(E2) • D2 ^ wi(D). 

Using (33) we obtain (45). Next use (45) to show that if F G Lin"^ and Di , D2 € SyniQ, 
then 

p(Fe~^^) - m(--Di) < + m(D2). 

Indeed, let D e SyniQ be such that R^^ = some R e Rot so that, 

by (45), 

m(-Di) + m(D2) > m(D). 

Condition E says 

p(Fe~^^e~^) > p(Pe~^^) - m(D) 

and hence 

> p(Fe"°>) - m(-Di) - m(J)2). 

Taking the supremum over all Di such that Fe“®i e S and the infimum over all D2 
such that Fe"®2 ^ ^ completes the proof of (i). (ii): This is trivial by (36). (iii): Let 

e(P) = inf I p(Fe~^) + m(D) : D e Sym{,, Fe~® € | . (47) 

This is an infimum over a smaller set than in (41) and therefore, 

e(F) < e(F). (48) 

To prove the opposite inequality, let D e Synio be such that Fe~® € <?. As F ^ 5, 
there exists a t e [0, 1] such that Fe"®*^ 6 d£. Inequality (36) gives 

p(Pe~^) - p(Fe~^'^ > p(Fe~^^) - m(D) + /n(Dr), 
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where we have used the homogeneity of m. Thus 

p(Fe-°) + m(D) > p(Pe~^') + m(D'), 

where D' := Dr e dS. Thus for each D as in the infimum (41) there exists a D' 
as in the infimum (47) with a value of p{¥e~^ ) 4- m(D') that does not exceed 
p(¥e~^) + m(D). This proves the opposite inequality in (48). The assertion about 
/(F) is proved similarly, (iv): If jr = (E(-),P(-)) is a process and if we write 
Pf = for some R € Rot, D e SyniQ then Theorem 1 (recall m = m** > 0) 

gives 

w(n) > p(E^) — p(E^) + m(D). (49) 

We have 

(P‘)-i) < p(Ff(P*)-le-°) + m(D) = p(E^) + m(D) 
by the definition of e and 

e(F(Pr‘) = «(E’) = p(E‘) 

by (ii); hence (49) leads to (43). Equation (44) is proved similarly. □ 

Let us estimate the extended energy functions at large values of |F|. 

Lemma 2. For every U € Sym“^ with det U = 1 we have 

^|lnU| <ln|U| <lnx/3 + |lnU|. (50) 

2V3 

Proof. Denote by | • |oo the maximum norm of Sym, i.e., 

|D|oo :=max{ldi|, 1^21, 

where d\ > <^2 > ^3 are the eigenvalues of D € Sym. Write U = with D e SyniQ 
sincedetU = 1. Wehave<ii+J2+<i3 =0;henced3 <0and|Dl(5o = max { — c/s } . 
If lD|oo = d\ then trivially |D|oo < 2di. If lD|oo = -^3, then -f ^2 - lD|oo = 0, 
i.e., |D|oo — d\-\-d 2 < 2d\. Hence in every case |D|oo < 2d\ and so 

which implies \\D\oo < ln|^^|. Combining this with |D| < V3|D|oo we obtain 
(50) 1. To prove (50)2, suffices to note that 

|^D|2 < 3^2^! < 3^21D1 oo^ 

i.e.,ln|^*^| <lnx/3+|D| 00* Combining this with |D|oo ^ |D| we obtain the desired 
result. □ 

Remark 2. Suppose that the material satisfies IVyushin's condition and assume that 
for each fixed ^ > 0, the set £s, given by (3), is bounded and that € £s\ 
moreover, assume that 



m(D) > co|D|, D E Symo 



(51) 
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for some co > 0. Then there exist positive constants Pf and functions 

Ye^ Ye^ Yf^ Yf • (0, oo) R such that 

Pe(S) + Pe In |F| < e(F) < y,(5) + Pe ln+ |F|, (52) 

Yf(S) - pf ln+ |F| < /(F) < Yf(S) - Pf In |F1 (53) 

for every F 6 Lin"'', where 5 = det F and In'*' is the positive part of In . 

Proof Let us first prove the upper bound (52)2. Since S is bounded, there exists a 
Co > 0 such that 



tn(B) < co|D| (54) 

for each D e SymQ. Let : (0, cx)) -> R be defined by 

Ye (S) = max {p(E) : E e } (55) 

for each S > 0. The maximum exists and is finite since £s is bounded p is continuous. 
Let 

Pe = 2^/3co, Yei&) •■= Ye(S) + (2co/V3)| In S\ (56) 

for each 5 > 0. Since the functions on both sides of (52)2 are objective and isotropic, 
it suffices to prove (52)2 only if F e Sym'^. If F e 5 then the proof is immediate. Let 
F e Sym"^ \ £. Set 5 = det F and 

D := ln(F/3'/2), i.e., Fe"® = 3'/^. (57) 

Then by (41) we have e(F) < p(3'/^l)+m(D) and using successively (57), (55), (54), 
(57), (50)i, and (56), we obtain (52)2. Let us prove (52)2. We have |AB| < mo|A||B| 
for some mo > 0 and all second-order tensors A, B. Let H G d£ so that H is invertible 
and |H| ^ 0. Then |F| = 1HH“^F| < molB[||H“^Fl for each F e Lin'^ and hence 

IH-'FI > C3|F| 

for all H € 95 and F e Lin+, where cs = min *|Hr* ; H € 95| > 0. The 

compactness assumption of £s implies that the minimum exists and is positive. For 
the same reason. 

Ye (3) ■= min (p(H) : H € d£s] (58) 

is finite. We now prove that (52)2 holds with 

Pe = co> 0, Ye(S) = Ye(S) ~PelnV3 + Pe In C 3 . (59) 

We can again assume that F e Sym+. Let D € SyniQ be such that H := Fe“® e d£. 
Then using successively (58), (51), (50) and (59) we obtain 

p(Fe-D) + m(D) > Ye(S) + P‘ In 1F|. 

Equation (47) implies (52)2- Inequality (53) is proved similarly. □ 




1. Work Conditions and Energy Functions 



21 



Remark 2 shows that there are strong differences in the behavior of e, f for 
large values of |F| on the surfaces detF = 8. We see that / is not even bounded 
from below. For this reason, we restrict ourselves to e. The growth of e is sublinear 
on surfaces of constant determinant since the set S of all values of the stress S, as 
opposed to the Piola-Kirchhoff stress, is bounded: The assumption that 

|S| < c < 00 (60) 

leads to the logarithmic growth due to the extra factor F^ in S = F^Dp(F). The 
sublinear growth is at variance with the linear growth of the energy in the infinitesimal 
deformation Hencky plasticity theory (see Temam [31]). There the stress relation reads 
S = D/?(E) with S satisfying (60), which excludes the superlinear growth. 



1.8. Appendix: A strange conjugation 

Next let us show that Condition E has little or nothing to do with the convexity 
properties of /?. To this end, we introduce a transformation on the set of materials that 
does not change the work in appropriately changed processes but changes the signs 
of potentials like the elastic potential, free energy, extended energies, etc. 

Consider a material X = (E, 0, F, T) determined by the constitutive objects 
£y T, M and define : £ -^ Sym, : d£ -> SyniQ by 

“f (E) = -f (E), EeS, “M(E) = -M(E), E e d£. 

It is easily seen that £, •‘T, **M satisfy Conditions (i)-(v) in Definition 1 and thus 
S, *T, *M determine a material that we denote •iM = (••£, •*0, *F, **T). Clearly, 

‘‘E = E, '*F = F, “t = -t. 

To determine the relationship between ® n and n , define, for every jr = (E(-), P(-)) S 
n, apair#7T = (»E(-), “?(•)) by 

«E(0 = E(d„ - 0, »P(0 = P(d„-0, te[0,d„l 

It is easily seen that for every process ;r e n we have e^U and the operation ^(-) 
establishes a one-to-one correspondence between FI and ^11. One finds that 

u;^(^ 7 r) = w^(tv), 

^m(D) = m(— D), D e SyiriQ, 



and 
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M. satisfies the cyclic second law 
p is an elastic potential for M 
i/r is a free energy for M 
r is a residual energy for M O 

M satisfies H’yushin’s condition 
S is convex ^ 

M satisfies the normality rule 
e is an initial energy for M ^ 

/ is a final energy for M O 



satisfies the cyclic second law, 

^^p := — p is an elastic potential for 
:z= —yjr is a free energy for 
V := — r is a residual energy for 
satisfies Il’yushin’s condition, 
is convex, 

satisfies the normality rule, 

:= —e is a final energy for ^A4, 

:= — / is an initial energy for ^M. 



1.9. Materials of type C 

The rest of the paper is devoted to a class of materials for which the elastic potential 
is logarithmically convex on deformation gradients of fixed determinant. They are 
shown to satisfy Il’yushin’s condition and the extended energy e can be calculated 
using the double convex conjugation with respect to the logarithmic deformation. An 
example of a material of type C is in Section 1.10. 

If / is any mapping defined on Lin"^ and 5 > 0, define the mapping fs on 
Symoby 

fs(A) = A € Symo. 

If p : Lin^ -> R is a continuously differentiable objective isotropic function then p$ 
is isotropic and continuously differentiable and 

Dp^ = S^, (61) 

where S is the traceless part of T defined by 

T(F) ;=Dp(F)F^. (62) 

To establish (61), note that in view of the isotropy of both sides of (61) it suffices to 
verify (61) only on diagonal elements (relative to some orthonormal basis), which is 
trivial. 

The function p is said to be (strictly) logarithmically convex if for every 5 > 0, 
the function p$ is (strictly) convex. It must be emphasized that this definition involves 
only symmetric tensors of constant determinant. By (61) the convexity inequality 
reads 



Ps(B) > ps(A) + S^(A) . (B - A) (63) 

for any A, B e Symo with the strict inequality sign if p is strictly logarithmically 
convex and A 7 ^ B. The convexity in the logarithmic deformation has been examined 
by Hill [11], [12] for elastic materials and was shown to be free from undesirable 
consequences (in contrast to the convexity in the principal stretches); our assumption 
is actually weaker because of the determinant restriction. Note also that the logarithmic 
convexity is consistent with the polyconvexity [30]. Let us show that the logarithmic 
convexity implies the Baker-Ericksen inequalities. 
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Lemma 3. Let p be objective and isotropic, T be given by (62), and E = diag(^i , ei, 
^ 3 ) e Sym^ be diagonal so that also T := T(E) = diag(ri , ^ 3 ) is diagonal If p is 

logarithmically convex then we have the Baker-Ericksen inequalities 

ti > tj if ei >ej (64) 

with the strict inequality sign if p is strictly logarithmically convex and ei > ej . 

Proof Write E = where A = diag(< 2 i , ai, af) e SyniQ, and fix the pair /, j. 

Apply the monotonicity (S^(B) — S^(A)) • (B — A) > 0 to B = diag(fci, ^2» ^ 3 ), 
where the triple {b\, Z? 2 , ^ 3 ) is obtained from {a\,a 2 , az) by interchanging the i, 7 - 
components. Then (B) is a diagonal tensor that differs from (A) by interchanging 
the i, 7 -components and one obtains 

(ti - tj)(ei - ej) > 0 (65) 

with the strict inequality sign if ei ^ ej and ps is strictly convex. Thus if ei > ej, 
we have ti > tj . Since the derivative of p is continuous, a limit provides that ti > tj 
also if ei = ej. □ 

Lemma 4. (i) If p is objective, isotropic, of class and strictly logarithmically 
convex and E, M € Sym"^ satisfy 

/i(ERMR^) > ^(EM) 

for each R e Rot then E and M commute, (ii) Similarly, if f : SymQ M is strictly 
convex and continuously differentiable and A, B 6 Symo satisfy 

/(A + RBR^)>/(A + B) 
for each R € Rot then A, B commute. 

Proof (i): If W is any skew tensor, then the differentiation of 
p(E,e^Me~"^) = p(E^^M) > p(EM) 

with respect to W at W = 0 gives ETE“^ . W = 0, where T := T(F) and F := EM. 
Hence ETE“^ is symmetric which means that E^T = TE^ and hence 

ET = TE. ( 66 ) 

Since T(-) is objective isotropic, we have T = T(C), where C = (FF^)^/^. Let us 
show that ( 66 ) implies that 

EC = CE. (67) 

In a suitable orthonormal basis we have C = diag(ci, C 2 , C 3 ), T = diag(^i, ti, tf), 
and (67) reads (no summation) 

(Ci - Cj)Eij =0, 1 < f, 7 < 3, ( 68 ) 

where Eij are the components of E in our basis. If Ci = cj then ( 68 ) holds trivially. 
If Ci ^ Cj then the strict convexity of p^ implies ti 7 ^ tj by the strict version of the 
Baker-Ericksen inequalities (65) (Remark 3), and as ( 66 ) reads (ti — tj)Eij = 0 we 
have Vij = 0 and ( 68 ) holds again. Hence also E and FF^ commute. This leads to 
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EM^ = M^E. That is, E commutes with and hence also with = M. (ii) 

is proved similarly. □ 

Definition 3. An ideal elastic-plastic material is said to be of type C if it satisfies the 
following conditions: 

(i) the material obeys the normality rule {see Definition 2); 

(ii) T has an elastic potential p that admits an objective, isotropic, class exten- 
sion p : Lin"^ -> R that is strictly logarithmically convex and bounded from 
below; 

(iii) for every 8 > 0, S = (see (3) for the definition of£s)\ 

(iv) 0 is in the interior of S. 

Note that (iv) and the boundedness of S imply that 

ci|D| <m(D) <C 2 |D| (69) 

for some positive constants ci , C 2 and all D e Synio- 

Proposition 6. Each material of type C satisfies Ilyushin's condition. 

Proof Verify Condition (ii) of Theorem 2. It suffices to establish (36) for pairs E 
and D as in that condition and satisfying additionally E € Sym"^. Let Q be a point of 
minimum of the function R p(ER^“^R^) = p{ERe~^), R e Rot, so that 

p(Re-^) > p(EQ^-^Q^). (70) 

By Lemma 4(i), E and commute and hence so also do E and QDQ^. Let 

8 = detE and A = ln(E/5^/^). By the commutativity, = 5V3^A-QDQ^ 

and thus p(Ee~^^^ ) = p^(A — QDQ^); the convexity of ps gives 

ps(A - QDQ^) > ps(A) - Ss(A) • (QDQ^). 

Since E e S, this reads 

p(EQe“°Q^) > /7(E) - S(E) ■ (QDQ^) = p(E) - S(Q^EQ) ■ D. 

By Lemma l(ii), S(Q^EQ) • D < m(D) which in conjunction with (70) leads to (36). 

□ 

We now give simplified constructions of the initial energy for materials of type C. 

Theorem 4. Consider a material of type C and let F e Lin"^. Then 

(i) there exists a unique D e SymQ such that E := Ee~^ e £ and 

e(F) = p(E) + m(D); (71) 

(ii) ifE e £ then E = F, D = 0; 

(iii) ifE ^ £ then E € and 

if additionally E e Sym'*', then F, E, D commute. 



( 72 ) 
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Hence, setting 



P = e^ 



we have the elastic-plastic decomposition 

F = EP 



(73) 



with E 6 5, D e and 

e(F) = p(E) + m(D). 

For F e Sym^ the logarithm of (73) and the commutativity give 

InF = InE + D, 

which decomposes In F into its projection In E onto In (5 flSym^) and the complement 
D in the normal direction to S at the corresponding stress. If we interpret the energy 
e as a stored energy of the associated nonlinear Hencky material, the above shows 
that the constuction of e involves the same projections as in the small deformation 
theory but in the space of logarithmic deformations. 

Proof. It is enough to give the proof in the case F € Sym"^. Write F = where 

5 = det F, A € SymQ, and consider an auxilliary minimum problem 

e = min {^^(A - D) + m(D) : D e Symo} . (74) 

The minimum exists since p is bounded from below and m is coercive (see (69)). Let 
D be a point of minimum. The optimality conditions say that 

SsiA-B) e 3m(D). (75) 

Furthermore, we have in particular 

ps(A -D) < ps(A- RDR^) 

for each R e Rot, and hence A, D commute by Lemma 4(ii). We conclude from 
(75) and dm(D) c S (see (34)) that S^(A — D) G <S. From Definition 3(iii) we 
have S(Es) = S(£^) = S and thus S^(M) = S^(A — D) for some M e SyniQ such 
that e £$• The strict monotonicity of S^, which is a consequence of the strict 

convexity of ps , implies that M = A— D and as A, D commute, we obtain e £. 
Hence 



e = p(¥e ®) + m(D) 
with Fe~^ e f , which implies that 



^(F) < i. (76) 

Let us further prove that the opposite inequality holds in (76). Note that the infimum 
in (4 1 ) exists as a minimum in the present case. Let D be the point of minimum in (4 1 ). 
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Let further Q be a point of minimum of the function R i-> p(Ee R e Rot. 

Lemma 4(i) tells us that F, QDQ^ commute and hence 

p(Fe-^) + m(D) > + m(QDQ^) 

= p«(A - QDQ^) + m(QDQ^) 

> e, 

which proves the opposite inequality in (76). Moreover, the argument shows that D 
is a point of minimum in (41) if and only if D is a point of minimum in (74) and that 
such a point commutes with F. 

(i): To prove the uniqueness of the point of minimum in (41), it suffices to prove 
the uniqueness of the point of minimum in (74). Let Dc^, or = 1, 2, be two distinct 
points of minimum in (74) so that, in particular, F, A, commute, 

S^(A-D,,)€am(D,,), (77) 

and 

ps(.A - Di) + m(Di) = pi(A - D 2 ) + /n(D 2 ). (78) 

The inclusions (77) imply 

S^(A-Da).D«=m(D«), (79) 

and the strict convexity provides 

ps(A - Di) > psiA - D 2 ) + S5(A - D 2 ) • (D 2 - Di), 
which in combination with (78) and (79) leads to 

S 5 (A - D 2 ) • Di > m(Di) 

in contradiction with (32). Thus the minimizer is unique, (ii): This follows from the 
uniqueness and Theorem 3(ii). (iii): The inclusion E e follows from the uniqueness 

and Theorem 3(iii). The inclusion (72) follows from the above proof. □ 

Remark 3. we consider the unique minimizer D and the E from Theorem 4 as a 
function o/F e Lin"^, written D = D(F), E = E(F), then the form of (41) implies 
that the functions have the following transformation properties: 

E(QF) = QE(F), P(QF) = P(F), 

E(FQ^) = E(F)Q^, P(FQ^) = QP(F)Q^, 
for every F G Lin"^, Q G Rot. We define Z : Lin*^ Sym by 

Z(F) = to(E(F)), 

where To(E) = T(E) — ^ (trT(E))l is the traceless part of the Kirchhoff stress. Then 
Z is objective and isotropic and for every 8 > ^we have the stress relation 

Zs(A) = Des(A) 
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for a.e. A e SyniQ. More precisely^ is convex and Z< 5 (A) is a subgradient of e^ at 
A for every A e SyniQ. Thus the Hencky material with the stored energy e is such 
that the traceless part of the Kirchhoff stress always belongs to the stress range S. 

Proof Let A be fixed and D be the corresponding minimizer. If B, D € SyniQ then 
the convexity of ps says that 

pKB-D) > p^(A-D) + S5(A-D).(B-A + D-D) (80) 
and the convexity of m that 

m(D) > m(D) + S^(A - D) • (D - D) (81) 

since (A - D) is a subgradient of m at D. A combination of (80), (81) with 

^^(A) = ps(A - D) + m(D) 

provides 

ps(B - D) + m(D) > (A) + S^(A - D) • (B - A). 

Fixing B and taking the infimum of the left-hand side over all D e Sym^, we obtain 
es(B) > es(A) + S^(A - D) • (B - A), 

which shows that es is convex and S^(A - D) is its subgradient at A. It follows that 
es is locally Lipschitz continuous and the subgradient coincides with the derivative 
for a.e. A € SymQ. □ 



1.10. The duality, example 

This section gives a description of the initial energy for materials of type C in terms 
of the convex conjugation in the logarithmic measure of deformation (Proposition 7). 
The dual description is the analogue, for large deformations, of the duality consid- 
erations presented in Temam [31, Chapter 1, Section 3] in the context of the small 
deformation theory of plasticity. 

Proposition 7. Consider a material of type C. Then for every 8 > Owe have 

es = <Ps, 

where is the convex conjugate of cps given by 

[ 00 otherwise, 

where is the convex conjugate of ps on SymQ. 

Proof The proof of Theorem 4 shows that 

e(F) = min [ps(,X - D) + m(D) ; D € Symo} 



( 82 ) 
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and the proof of Remark 3 that es is convex on Sym^ for every 5 > 0. For a fixed 5, 
consider the family {/d : D e Symo} of convex functions on Synig given by 



/D(A) = p^(A-D) + m(D). (83) 

Equation (82) asserts that 

^^(A) = min {/d(A) : D G Symo} ^ A G Symo- (84) 

The application of the general formula for the convex conjugate of a minimum of a 
family of convex functions [9, Chapter I, Equation (4.6)] in the present case gives 

e|(S) = (min {/d : D G Symo})*(S) = sup {/^(S) : D € Symo} . (85) 

S G SyniQ. We have 

/d(S) = (W(- - D) + m(D))*(S), (86) 



where ps(- — D) denotes the shifted function A p^(A - D). In (86), D is a 
parameter and hence m(D) an additive constant during the evaluation of the convex 
conjugate. The rules for the evaluation of the conjugate of a function shifted by an 
additive constant and shifted in the domain space [9, Chapter I, Equations (4.8) and 
(4.9)] yield in the present case 

/D(S) = p|(S) + S.D-m(D), (87) 

where is the conjugate of ps. Inserting (87) into (85), 

e*(S) = pJ(S) + sup {S • D - m(D) : D G Symo} (88) 

and we note that 

m*{S) := sup {S • D — m(D) : D G Symo} 

is the conjugate of the convex dissipation function m. A combination of (35) with 
(88) then gives ^^(S) = ^^(S). Since es is convex, we have es = (^ 5 )** = □ 



Example 

Finally, an elastic-plastic material is described for which the initial energy and the 
Hencky-type response are calculated explicitly. Let 

f = {E := VR G Lin+ : V G Sym+, R G Rot, |L| < x/^] , 

wherer, ^ are given positive numbers and L = In V—jtr(lnV)l. Define T : Lin"'' 
Sym by 

f (E) = a' (8)81 -F jSL 

for each E g Lin"^, where 5 = detE and a : (0, 00 ) -> R is a continuously 
differentiable function. We use the same symbol T for the restriction of T to £. 
Define M : 85 -> Symo 

M(E) = (jS/t)R^LR, E g d£. 
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The objects T, M determine an ideal elastic-plastic material (E, O, F, T) (see 
Definition 1) and one finds that 

S(E) = /SR^LR, E e Sym+, 

5 = {S 6 Symo : |S| < t} . 

The material is of type C. Indeed, other things being obvious, it suffices to verify 
Condition (ii) of Definition 3. One finds that if p : Lin"^ -> R is defined by 

p(E) = a{&) + ij8|L|2 

for each E e Lin^, where E = VR is the polar decomposition of E, then p is an 
elastic potential for T on Lin"^. We identify p with p. One finds that 

Pi(A) = a(3) + i/3|Al^, A € Symo, 

which is a strictly convex function. Thus the material is of type C. The dissipation 
function is given by 

m(D) = m**(D) = r|D|, D e SyniQ, 

and since it is convex, by Proposition 5, the maximal and minimal residual ener- 
gies are 

= r|lnD|, t(Re^) = -r|lnD|, 

R e Rot, D e SymQ. The initial energy is given by 



a(5) + i^|L|2 if|L|<T//3, 

a(<5)-i(r/^) + r|L| if|Ll>r/^, 



(89) 



for each F e Lin"^, where throughout, F = VR is the polar decomposition of F and 
S = detF. The case |L| < r/;S in (89) follows from Theorem 3. Let us consider 
the case |L| > r/^S. By the objectivity and isotropy we may restrict ourselves to 
F = V e Sym+ \ in fact it suffices to consider an F that is diagonal in some 
orthonormal basis. By Theorem 4, it suffices to seek the infimum in (47) only on 
those D that are diagonal in some basis of eigenvectors of F. If Fe~*^ e d£ then 

p(Fe-°) = a(3) + tV2jS. 



Consider an orthonormal basis of eigenvectors of F so that F = diag(ui, V 2 , V3). 
By (47), 



e(F) = a(S) + r^/ 2 ^ + xM, (90) 

where 

M := inf ||D| : D 6 Symo, € 95, D diagonalj . (91) 

For a D = diag(ifi , di, d^) as in (91) the condition Fe"® € 95 gives | Ino F — D| = 
t /5 and we are led to 

M = inf {|d| :deC], 

where C = {d € P, \d - c\ xjP) is the circle with the center c, c,- = ln(ti,75'/^), 
S := v\ V 2 V 3 , in the plane P = e + ^^2 + <^3 = 0} . By | Ino F| > x//3 the 
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origin (0, 0, 0) is not in the interior of C. Thus we are looking for the point of the 
circle C whose distance from the origin is minimal and hence M = | Ino F| — r/<5 for 
each F € Lin"^. A combination with (90) gives (89). Note the linear growth of e in 
1L| outside £. A differentiation of e gives 



Z(F) = 



if |L| < t/P, 
rL/|L| if |L| > x/p. 
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2. Generalized Elastic-Plastic Decomposition 
in Defective Crystals 

G. P. Parry* 



Abstract. I outline ideas which allow one to prove a rigorous type of elastic- 
plastic decomposition between related crystal states. The context of the work is 
a theory of defective crystals where the microstructure is represented by fields of 
lattice vectors, and the construction of the “generalized elastic-plastic decomposi- 
tion” is a geometrical procedure which does not involve any notion of stress. The 
decomposition of the change of state has the form where F^ and F| 

are piecewise elastic deformations and pP is a rearrangement of “unit cells” of a 
related crystal structure. The relevant unit cells do not derive from any supposed 
perfect lattice-like properties (e.g., translational symmetry) of the crystal states 
(no such presumption is appropriate since the crystals, and corresponding unit 
cells, support continuously distributed defects) but are constructed by means of 
the piecewise elastic deformations using a self-similarity property of the crystal 
states. Further, each individual unit cell, with its attendant distribution of lattice 
vector fields and defect densities, is translated to its new position by the rearrange- 
ment pP, so that the corresponding mapping of points of the body represents a 
“change of shape” of the crystal domain. 



2.1. Introduction 

Lee et al. [7] and Lee’s [6] notion of elastic-plastic decomposition has been influential 
in the development of phenomenological theories of plasticity. Their decomposition 
is based on a thought experiment whereby a notional “unstressed state” is used to 
define a linear transformation pP at a particular material point. Thus, quoting from 
Lee et al. [7], “to achieve the unstressed state, the body must be considered to be 
cut up into small elements, and with the removal of the loads on the elements, the 
unstressed state is approached as the element size approaches zero. However, the 
unstressed elements do not fit together to form a continuous body.” Then an elastic 
deformation F (with components equal to the partial derivatives of a displacement 
field) is decomposed as F = F^F^, where F^ and pP “are not in general matrices of 
partial derivatives, though their product . . . is.” So the elastic-plastic decomposition 
is effectively a constitutive assumption, and common theories of plasticity proceed 

* Division of Theoretical Mechanics, School of Mathematical Sciences, University of Nottingham, Not- 
tingham NG7 2RD, UK. 
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by making explicit assumptions on for example that F^ is determined in terms 
of “slip” on a finite set of prescribed planes compatible with some assumed crystal 
symmetry. 

I proceed differently here, basing the analysis on a theory of crystals with mi- 
crostructure proposed by Davini [1], developed in Davini et al. [2, 3], Fonseca et al. 
[4], Glanville et al. [5], Parry [9, 10, 11], Parry et al. [12, 13]. In the theory, a crystal 
state corresponds to the prescription of three smooth linearly independent “lattice” 
vector fields over some region 5 c Convincing analogues of defect measures 
like the Burgers vectors and dislocation density tensor are readily available in the 
context of Davini’s theory. However, it is also clear that there are many more objects, 
with properties like those of the classic Burgers vector and dislocation density tensor, 
whose roles one should appreciate in this theory of continuous distribution of de- 
fects. So the first few sections of the paper are devoted to constructing and examining 
generalizations of the “classic objects.” 

In fact it turns out to be productive to attempt to answer a seemingly simple 
question: Given two crystal states, when are they elastically related to one another? It is 
easy enough to discover necessary conditions which must be satisfied if two states are 
to be elastically related, but it seems to be much more difficult to produce conditions 
sufficient for this purpose. However, according to Olver [ 8 ], Cartan has provided 
the mathematical theory which allows one to give sufficient conditions that crystal 
states are locally elastically related (see Section 2.4 for the definition of possibly 
unfamiliar terms). These sufficient conditions are a subset of the above conditions 
necessary for states to be elastically related, and they may be expressed in terms 
of the dislocation density tensor and a finite number of its (directional) derivatives. 
Furthermore, it will be clear that the conditions which are sufficient for states to 
be locally elastically related are not also sufficient for those states to be elastically 
related. In fact, it is particularly noteworthy that the archetypical two states which are 
locally elastically related but not elastically related (to each other) are related by slip 
in planes determined by the lattice vector fields. In a sense, then, the slip mechanism of 
phenomenological plasticity theories emerges quite naturally in this microstructural 
theory based just on the geometry of relevant crystal states. For that reason, the paper 
is focussed on examining pairs of states which satisfy the conditions necessary for 
them to be elastically related, which Davini and I have called neutrally related states. 

In Section 2.7, I provide a generalized elastic-plastic decomposition of neu- 
trally related states. The form of the decomposition is quite explicit once such states 
are given; two piecewise elastic deformations are involved, and the plastic part of 
the deformation is determined geometrically as a rearrangement of certain periodic 
structures constructed using a self-similarity property of the relevant crystal states. 

2.2. Preliminaries 

Let the structure of the material correspond to the prescription of 3 vectors i\, I 2 , €3 
at each point jc of a region F, and assume that smooth vector fields are thereby defined 
on that region. 
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Define the crystal state E by 

E = {£«(.), = (1) 

Let u : B ^ B* = u(B) be a smooth diffeomorphism. This diffeomorphism leads 
to the crystal state E* = aE defined by 

E* = {<*(.), B*; a = 1,2, 3}, (2) 

called “the crystal state elastically related to E via the diffeomorphism ii,” if the 
vectors £*, a = 1, 2, 3, are determined throughout by the deformation gradient 
Vm and the vectors ia^ a = 1,2,3, via 

K(U{X)) = Vu(x)la{x), JC e 5, a = 1, 2, 3. (3) 

Briefly, E* is elastically related to E if the vector fields in E* are related to the 
vector fields ia in E by 

K^^uia, (4) 

SO that the vector fields “behave as infinitesimal line elements in the deformation ii.” 
Suppose the three vector fields ^3(0 linearly independent at 

each point. Then one can define dual vector fields d^{*), d}{*), d?{*) by 

d\x)-h{x) = 8l, a,b = \,2,3, X eB, (5) 

The dual vector fields in state E* can be shown to satisfy 

(u(x)) = [V«(JC)]-T dyx), a = h2,3,xeB. (6) 

Define 

S°‘’(x) = d°(x) ■ V A dHx), (7) 

n = d\x)-d^(x)Ad^(x). (8) 

Note that n is nonzero throughout B, and assume that n > 0. A calculation gives that 

I— j («(;c)) = / — j (a:), a,b=l,2,3,xeB. (9) 

By virtue of property (9), the quantities 5“*^" are elastic scalar invariants. They are 
quantities that are unchanged by elastic deformation, in the sense of (9), so they may 
be regarded as measures of inelastic changes of state. 



2.3. When are two crystal states elastically related? 

If one is to factorize a change of crystal states into some kind of elastic and inelastic 
(“plastic”) components, it is a sine qua non that one must be able to recognize when 
two crystal states are elastically related to one another. In this section I derive certain 
necessary conditions which must be satisfied if two states, E , E* say, are to be related 
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in that way. Thus, when states E, E* are given as in (1) and (2), I derive certain 
implications of the relation (3). The first of these follows directly from (9): 

range I ) = range ( ) • w = range , (10) 

B* \ n J B \ n J B n 

since = u{B) and (9) holds, respectively. So 

range I ) (11) 

B \ n J 

is an invariant set, unchanged by an elastic deformation of the state E. 

Now it is plain that the essential property of the quantities which leads to 

the invariance of the set (1 1), is that each such quantity is an elastic scalar invariant, 
satisfying the relation (9). Thus if vs(«) is any elastic scalar invariant of the form 

vsW = a = l,2,3,A:6B, (12) 

for some integer k and some appropriate function v with the property 

(«(^)) = vs(J^) (13) 

whenever E* is elastically related to E, it follows that 

range = range . (14) 

B* B 

I note now that, generally, there is an infinite number of elastic scalar invariants. For 
if Vs is any one such invariant (for example, any one component of (S^^/n)), then 
by differentiating (13) and combining the result with (3), one obtains 

[<:-V*vs.](«(Ar)) = [/^-Vvs](A:), fl = l,2,3. (15) 

Thus, if vs is an elastic scalar invariant, so is each of the first-order directional 
derivatives • Vvs, « = 1, 2, 3. 

Similarly, each of the jrth-order directional derivatives 

(4,-V)(Vi-V)...(««.-V)ve, (16) 

where each of a\ .as-\, as may take the values 1 , 2, 3, is an elastic scalar invariant, 
and s is arbitrary. Generally, then, there is an infinite number of necessary conditions 
of type (14) which must be satisfied if states E, E* are to be elastically related. 
However, it will turn out that not all of these conditions are independent, for if each 
of a certain finite number of these conditions holds, then all of the conditions hold 
(see the corollary to Theorem 5). I shall refer to any necessary condition of the form 
(14), where ve is a directional derivative of S^^/n of any finite order, as a necessary 
condition of type I in order to differentiate such conditions from those patterned on 
the invariance of the Burgers integral, which I now discuss and refer to subsequently 
as necessary conditions of type II, and also from some other conditions. 
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Let E, E*, given by (1),(2), be elastically related to one another, so that (3) 
holds and B* = ii(J9). Let f be an arbitrary circuit in B, and let u(f ) be the image 
circuit in B*. Then from (6), 

(f d°* du = (f (Vu)~^ d“ • (Vu)dx = (f d° ■ dx. (17) 

/«(f) /f J( 

So d° • dx, the Burgers integral, is an elastic invariant circuit integral, unchanged 
by any elastic deformation of state E. It is clear that the essential property of the 
field which leads to condition (17), is that the field satisfies (6) when state E 
undergoes elastic deformation to state E*. Evidently, one may construct an infinite 
number of fields which satisfy analogues of (6): Let vs be any nonzero elastic scalar 
invariant. Then 

{v|;(«(x))d‘'*(H(j:))} = [VB(jr)rT{vj;(x)d‘'(j«r)}, a = 1,2,3, (18) 

via (13), whenever (6) holds. So 



^ VY,d^ 



• dx ism elastic invariant circuit integral. 



I shall refer to necessary conditions of the type 

<f vs*d^* • du = (ff vsd^ • dx (20) 

Ma h 

as necessary conditions of type II, noting however that as condition (20) stands, it 
is not so useful as a condition of type I because of the explicit dependence on the 
deformation u (which is unknown, a priori, given states E, E*). 

Next, in an evident generalization, consider elastic invariants corresponding to 
integrals of functions of the lattice vector and their spatial gradients of arbitrary finite 
order over closed surfaces and volumes. (The restriction to closed paths and surfaces 
is discussed in Davini [1] and Truesdell et al. [14].) The first point to note is that it 
suffices to consider volume integrals of the form 

[ fxndv = f fata,^ta,.-.,^'‘la])ndV, VQB, (21) 

Jv Jv 



• integrals over closed surfaces can be converted to volume integrals of this 
type (since the functions under consideration are smooth, by assumption, k 
is arbitrary, and the divergence theorem applies), 

• n = det{rf^} ^ 0, by assumption. 

Note that from (6), 

n*(u(x)) = [det(Va)]“^n(x). (22) 

Hence, for example, for V c with E, E* elastically related 

f n*dV* = f [det(V«)rin[det(V«)]^/V= [ ndV. (23) 
JuiV) JV Jv 
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Similarly, using (13), 

I V'zndV is diii elastic invariant volume integral (24) 

Jv 

whenever vs is an elastic scalar invariant. In particular, there is an infinite number of 
necessary conditions of the type 




v^*n*dV* 



L 



v^ndV, 



(25) 



which I shall refer to as conditions of type III. 

Again, conditions like (25) are not so useful, as they stand, as conditions of type 
I because of the explicit dependence on u. However, it will be possible to convert 
conditions of types II and III to more useful forms later on, in Section 2.6. 



2.4. The basic theorem on locally elastically related states 

It is a natural question to ask if any of the above conditions, which are necessary in 
order that two crystal states be elastically related, are also sufficient for that purpose. 
The answer to this question turns out to be no, and relevant counterexamples will be 
provided later. But the conditions of type I are in themselves sufficient that the two 
crystal states are locally elastically related, where this concept is defined as follows. 

Definition 1. State E is locally elastically related to state It* if, for each xq e B, 
there exists a diffeomorphism Uxq defined on a neighbourhood Nxq ofxo in B, with 
Uxo(Nxo) C B*, such that 

f-l(uxo(.x)) = {Vuxo(x)) laix), a = 1, 2, 3, x e Nxq, xq € B. (26) 

Three more definitions are needed (cf. Olver [8, Chapter 8]). 

Definition 2. Let v be a 9-tuple whose entries are the elastic scalar invariants In, 
a,b,= 1, 2, 3. The 5-th-order classifying set associated with the crystal state E is the 
range (over B) of all rth-order directional derivatives of v with r < s. 

Definition 3. The rank ps of the map which takes a point x to the set of directional 
derivations of v of order less than or equal to s is called the order of the 5 th-order 
structure map. 

Definition 4. Let pn be the maximum value of Ps. Let .yo be the least value ofs such 
that Ps = Pe- Then is called the order of the structure map. Note that 

• Ps is assumed to be well defined, 

• ps is strictly increasing as a function ofs, 

• Ps is well defined because Ps < 3 for all s in this context. 

One of the main results of Cartan’s theory of equivalence, reinterpreted by Olver 
[8, p. 437] and somewhat adapted, is the following theorem. 
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Theorem 5. If crystal states E, E* are such that sq = 5q and the (sq -1- l)th-order 
classifying sets associated with the two states are identical, then E is locally elastically 
related toll* {and vice versa). Moreover, if xq e B* map to the same point 

in the {sq + l)th-order classifying set, one can take uxq{xq) = jCq, for example, in 
the definition of locally elastically related states. 

Corollary 6. Let be the fourth-order classifying set associated with state E, which 

is the range {over B) of all rth-order directional derivatives of{S^^/n) with r < 4. 
The condition 

(27) 

is a finite subset of the necessary conditions of type I. Since sq = Sq <3, it follows 
that (27) is sufficient that E, E* are locally elastically related to each other. 

Note further that if E, E* satisfy (27), so that E, E* are locally elastically 
related, the basic property (13) of scalar quantities vj ensures that all necessary 
conditions of type I are then satisfied. Indeed, if (27) holds, then given xq e B, there 
exists a neighbourhood of jco in 5, and a mapping II jco : Nxq uxq{Nxq) ^ B* 

such that (26) holds. It follows from (13) that 

vs*(mjco(^o)) = ve(xo), (28) 

in particular, for an arbitrary elastic scalar invariant vs . Therefore 

range c range . (29) 

B B* 

Since one also has the reverse inclusion, by an evident argument, it follows that (14) 
holds quite generally. In a sense, this argument shows that the set of up to fourth- 
order directional derivatives of the dislocation density tensor is a functional basis of 
all scalar invariants — though the set is not necessarily “minimal.” 

Remark 1. To show that it is not necessarily true that if the conditions of the theorem 
are met, then E and E* are elastically related to each other, consider the following 
example. 

Let E = [€a, Bc\ a = 1, 2, 3}, E* = Bn\ a = 1, 2, 3}, where e\,e2, es is 
the canonical basis of'R ? , Be is a cube, and Bn is not a cube. Then the conditions of the 
theorem are met because {S^^/n) = 0, {S^^/n)* = 0, and all the relevant directional 
derivatives are zero. One verifies the conclusions of the theorem. IfxQ e B^ Xq e Bn 
are arbitrary, the diffeomorphism Uxq = jc + jCq — xq is such that 

(■*aiUXo(x)) = Ca = (yUxo)ta(x) = HjCqC^o) = •»:$. (30) 

However, there exists no diffeomorphism u with the properties that 

€a = (Vu)efl and u{Bc) = Bn (31) 

because this would imply that (Vii) is the identity, so that Bn would be a translate of 
the cube Be, and so Bn itself would be a cube, contrary to assumption. 
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To conclude this section, I prove a result which will be needed later on, while 
acknowledging that the motivation for the theorem will not be clear at this point (so 
the reader could well skip this at a first reading). 

Theorem?. Given a crystal state E, suppose that jn = that la • 

es = and that = 0. Then E is elastically related to a state E 3 = 

{ifl('), a = 1, 2, 3}, where 

iaix) = X€B, (32) 

for some : x^(B) M?, withx^(B) = x^(B). 



That is to say, if the hypotheses of the theorem are satisfied, then E is elastically 
related to a state E 3 , where the lattice vectors depend only on 

Proof It is given that, for the state E, S^^/n = • V A d^/n = T^^(x^). Hence 

V A — nT^^{x^)ia = \T^^{x^)Sarsd^ A d^ , and taking the divergence, 

iv . {T^^earsd^ A dO = nT“harsT^'' + = 0, (33) 

when the superposed dot denotes djdx^ , So the given dislocation density tensor and 
components iaz must satisfy the compatibility conditions 

t^^£^3 = -T^^SarsV'- (34) 



I show that there is a crystal state E 3 = {ia(*)» B\a = 3} with iaipc) = (jc^), 

~ satisfying (34) and 



~ ~b 
d -W Ad 






(35) 



To do this, it is convenient to recall from Parry et al. [12], say, that 



d“ -V Ad’’ 



==L°-d'’, a,fe=l,2,3, 



(36) 



where, for example, L* is defined as (<2 • V )<3 — (<3 • V)f 2 , the Lie bracket of the 
vector fluids <2(0 and <(•). Therefore, with T“* and <^"3 the prescribed functions of 
x^, it is enough to show that 



L"+ = T‘’’’(x^)t+ 



(37) 



has a solution for <^aa = ^ta(x^)’ ^ = 1,2, 3, a = 1 , 2. These three vector equations 
read (dropping the ''■) 

^23hi - ^33<2i = ] 

(38) 

(■nhi - I 



The equations with i = 3 are identically satisfied, since <^3 = ^a^ and the corre- 
spending equations hold in E. At least one of £ 13 , iiz, ^33 is nonzero; suppose for the 
sake of argument that ^13 ^ 0. Then equations (38)2 and (38)3 imply that (38)i holds. 
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by virtue of the hypothesis = 0. So (38) is equivalent to four linear equations 

for the six unknown quantities / = 1, 2, 3, or = 1, 2, given / = 1, 2, 3 

and a, b, = 1, 2, 3. Choose i\\ = in = 0. Then (38)2 and (38)a become 

^ai’) ^ = 1 ? 2 , | 

eni2i=T^‘’tai, / = l,2,j ^ ^ 

and these equations have a unique solution, given appropriate initial conditions. 

So one can construct a state E 3 of the required form, satisfying (34) and with 
the same dislocation density tensor as E . Recalling the definitions given earlier in this 
section, E , E 3 are rank zero states with the same first-order classifying sets. (The only 
thing that is not immediate is that {ia • V)T'’*^ depends only on in state E, but this 
quantity equals which is a given function of x^.) Hence E and E 3 are locally 

elastically related by Theorem 5. Also by the theorem, the (local) elastic deformations 
leave invariant the planes x^ = constant. I assume that B = for some elastic 
deformation f (even though this restricts the topology, and size, of B). □ 

2.5. Self-similarity of states with constant dislocation density 

It will turn out, later, that generalized elastic-plastic decompositions can be divided 
into two classes, according as to whether or not the dislocation density tensor is 
constant in the two states under consideration. (This statement will become clearer 
in the section dealing with neutral deformations.) So, as a necessary preliminary, 
I discuss geometrical properties of a state E which has all components of 5^^/n 
constant throughout B. In this case, all directional derivations of order greater than or 
equal to one are zero. Theorem 5 may be applied to states E , E* = E , since it is clear 
that 50 = ^0 the associated classifying sets are identical (as E , E* are the same 

state). Therefore, let jco € B,Xq e B(= B*)bearbitrarypointsof J5.Thenjro, Jc^map 
to the same point in the (50 + l)th-order classifying set, since = {S^^/n) 

are constant in B. The theorem gives the existence of a diffeomorphism Uxq defined 
in a neighbourhood Nxq of jcq such that 

(mjcoW) = xeNxo, and Uxo(xq) = x^. (40) 

This is the self-similarity to which I have referred earlier: Lattice vectors are deter- 
mined throughout the state E by knowledge of the lattice vectors in an arbitrary small 
neighbourhood of an (arbitrary) fixed point. In fact, there is a Lie group structure un- 
derlying this self-similarity, and the local elastic deformation Uxq in (40) may be 
related to the composition function in the group, but I do not consider this aspect in 
detail here (see Parry [11]). 

As an example, consider the state E = [ia('), a = 1, 2, 3} with = 

(1, — jc^, 0), d? = and B = [0, 1]^. Then 

S^^/n = (41) 

is constant in B. One calculates, via (40), that 

Uxq=X^ {x'^ - JCo)(-^ 0 * “ ^o)^l + ^0 ■“ ^ 0 , 



(42) 
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has the properties 

d° (uxo(x)) = [Vkjco]”^ x e Nxo and uxo(.xq) = (43) 

whenever Nxq C B is such that Uxq {^Xq) C B. 



2.6. Neutral deformations 



Now I continue the examination of the necessary conditions of types II and III. 
Consider, to begin with, the conditions of type II, corresponding to circuit integrals 
for which 



(f vj:*d^^du = (fvj:d^dx. 

JuiO 4 



(44) 



The question of interest is whether the three types of necessary condition, taken 
together, are sufficient that states E, E* are elastically related. Suppose then, in 
particular, that (44) holds for some choice of the mapping u : B -> B* but that 
it is not known whether E, U* are elastically related. Now introduce state E' by 
elastic deformation of state E* via the mapping : B* -> B, so that E' = 
{€^'(*), B; a = I, 2, 3}, where 

<a («"’ w) = (y)] y^B* = u(B). (4S) 

By the basic invariance property of the integral • dx, it follows that 

(46) 



(f vj:*d^* • du = (i VY.'d^' • dx. 

Ju(0 /c 

So (44) holds for some mapping u : B B* if and only if there exists a state E* 
such that 



^ v^fd^' dx = ^ VY,d^ • dx, 



(47) 



for each circuit f c B. It follows that, if E is not elastically related to E* but (44) 
holds for some mapping u : B B*, then there is a state E' 7 ^: E such that E' is 
elastically related to E* and (47) holds. 

Now, since f is arbitrary, (47) implies 

V A (v'^fd^'^ = V A (vsd") . (48) 

Likewise, the conditions of type III lead to the relations 



= vsn. 



(49) 



Thus equations (48) and (49) must have solutions E' 7 ^ E with E' elastically related 
to E* if E is not elastically related to E* but (44) and (45) hold for some mapping 
u :B-^ B\ 
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Clearly, the constant elastic scalar invariant vs = 1 is an admissible choice of 
Vs in both (48) and (49). So, in particular, (49) implies that for arbitrary elastic scalar 
invariants vs , 

n = n\ vs' = vs. (50) 



Since (50)2 implies that 



range vs' = range vs , (51) 

B B 

it follows that all conditions of type I hold (for states E, E') when (49) holds. More- 
over, since vs is arbitrary in (51), that equation implies that the classifying sets of 
arbitrary order s corresponding to E , E' are identical. Therefore (50) (or (49)) implies 
that E, E' are locally elastically related to one another. 

Also, taking vs = vs' = 1 in (48) one deduces 

V A = V A Vvs' A d^' -h VS'V A = Vvs A -h vs V A d^, (52) 
and since (50)2 implies Vvs' = Vvs, (52)2 holds only if 

Vvs A id^' - d^) = 0. (53) 

It follows that the necessary conditions of types I, II, and III hold for states 
E, E' if and only if 

VAd^'=VAd^ Vvs A (^^^' - d^) = 0 , vs' = vs, n' = n, (54) 
for arbitrary scalars vs . 

The rest of this section is devoted to showing that (54) holds if and only if the 
choice of vs is restricted to the set of directional derivatives of jn of order less 
than or equal to one. So set 

= {(5'’7«), (la • V)(S'’^ln)\ a,b,c = 1, 2, 3}. (55) 

Reiterating, the equations 

VAd^' = VArf^ Vvs A(d^'-rf^) = 0, vs' = vs, n' = n, 

(56) 

will be the focus of the rest of the paper. To emphasize this fact, states E' such that 
(56) holds, given state E, will be called states neutrally related to E. Mappings of E 
to states E' neutrally related to E will be called neutral deformations. 

Next, I show that (56) implies (54). First notice that V A d^ z= V A implies 
that d^ z=z d^ + for some potentials t^, = 1, 2, 3, not all of them constant if 

E' 7 ^ E. Then (56)2 gives Vvs A Vr" = 0 , vs e a = 1, 2, 3, so in particular 
Vs = vs(r^) for a nonconstant potential r", vs € Now either all the S^^/n 
are constant, or there exists one which is nonconstant; call it In the latter case, 
0 = 0(r^) and I shall assume that this relationship is invertible. It follows that 
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^ Furthermore, in particular, (ih • V)(<S‘^^/n) is a function of 0\ 
call it v(^), since first-order directional derivatives are in T. Hence 

j dv 

{la • V)(^fc • • V)v(0) = {la ■ VG)— (57) 

is a function of 0, since la e !F. Likewise, by induction, directional derivatives 
of S^^ln of arbitrary finite order are functions of the single variable 0. 

Recall now the definitions introduced in Section 2.4, and divide the discussion 
into two cases according as to whether or not all the components of S^^fn are constant: 

(i) : Each component ofS^^fn constant. In this case, the order of the 5th-order 
structure map ps is zero for all choices of s, and each ^th-order classifying set consists 
of a single point. Using theorem 5, one sees that (56)3 implies that the first-order 
classifying sets corresponding to states E, E' are identical, so (56) implies that states 
E, E' are locally elastically related to each other. 

(ii) : There is a nonconstant component ofS^^jn, call it 9. It has been shown that 

all directional derivatives of jn depend on the single variable 9 in this case. Let 
p^(9^ denote a “vector” whose components are the elements of the set of directional 
derivatives of 5^^/n of order less than or equal to s. Then V (p*^ (0)) = 0 has 

rank 1, so = 1 for all s and the order of the structure map is zero once again. So 
(56) implies that E, E' are locally elastically related to each other in this case, too. 

These remarks allow a proof of the fact that (56) implies (the integral version 
of) (54). 

Theorems. If (56) holds, then 

(f VY,'d^ • dx =: (£ v^d^ • dx, f VY,'n'dV = f v^ndV 
Ji; Jv Jv 

for arbitrary scalars v^, arbitrary circuits arbitrary volumes V. 

Proof 

• Since (56) holds, E, E' are locally elastically related, and by virtue of 
Theorem 5, the relevant local elastic deformation Uxq has the property 
uxq(xo) = xo- Let Vs be an arbitrary scalar. Then putting u = Uxq,x = jcq, 
E = E' in (13), one obtains 

= vs(aro), xqeB. (58) 

• Suppose each S^^/n is constant. Let jcq, Xq e i? be arbitrary. Then (40) 
holds and Uxq(xq) = Xq. Let ve be an arbitrary scalar and set Ea^ = 
{ta(-).Nxo; a = 1,2,3}, E* = {4(0, «Xo(A^Xo); « = 1, 2, 3), so that 
Eat* is elastically related to E^^. Then since Xq e Uxq(Nxq) and xq 6 Nxqj 
by (13), 

VE (Jf J) = VE^, (jrJ) = VE^ (ato) = VE (xo). (59) 

So all scalars are constant in this case. 
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• Similarly, if 0 is a nonconstant component of /n, one shows that all 
scalars depend on the single variable G (since (40) holds for points xo, Xq 
such that 0(jro) = ^(Xq) and «xo(->^o) = Xq). 

Thus, fy vs'n'dV = fy v^ndV for arbitrary vs because (56)4 and 
(58) hold. Also, with C = 9<S, for arbitrary vj, 

§^{vY,'d°' - vsrf") -dx = v-zid"' - d“) • dx, by (58), 

= fs ^ ~ •’y 

But this last surface integral is zero in the case that each /n is con- 
stant because each scalar vs is also constant in that case. And when 0 is a 
nonconstant component of by the above remarks, 

Vvs A {d“' - d“) = ^ V6> A (d"' - d“) = 0. 
dO 

by (56)2 with the choice = 0 e □ 



2.7. Generalized elastic-plastic decomposition of neutrally related 
crystal states 

First, notice that if (56) holds, it is not necessarily true that states D, E' are elastically 
related. For if for some potentials a = 1,2,3, with 

det(VV^^) = 1, then (56) holds (with V A = V A = 0, each e T zero, and 
n' = n = 1). An elastic deformation of E' via the mapping ^ = (^^) : B i^(B) 
gives the state E* = {Cay f(B); a = 1, 2, 3} with vol ir(B) = vol B. But by an 
argument similar to that given in the remarks after Corollary 6, there is no elastic 
deformation which maps E* to E unless ^{B) is a translation of B, Therefore it is 
worthwhile to try to understand the structure of solutions of (56) for states E' given 
the state E, and a generalized notion of elastic-plastic decomposition will result from 
this endeavour. This generalized elastic-plastic decomposition will be confined to 
pairs of neutrally related states, though, and so it is limited in that respect. 

It would be an interesting project to try to construct a canonical procedure 
whereby arbitrary states, defined over the same domain but not satisfying the con- 
ditions (56), are deformed one into the other; this might require an appreciation of 
which of the quantities that appear in (56) are independent of each other. For exam- 
ple, suppose that two states with different constant values of defined over the 

same domain, are given. Note that a state with given constant S^^/n is defined to 
within elastic deformation, so a canonical state with the prescribed constant dislo- 
cation density tensor may be chosen. Likewise, one may construct a canonical state 
with given constant (S^^/n)' ^ {S^^/n). The desired procedure would deform the 
first canonical state to the second. Even more specifically, take (S^^/n) = a ® b, 
(S^^/n)' = a' (g) b', a, ft, a', ft' e R^. Then one may take as the first canonical state 
E = {^^(*), B; <2 = 1, 2, 3} with d^ = + b^a A x(j){t), where t = a Ax • b and 
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0 (jc) = (e^ — I— and as the second canonical state E', where primed quantities 
replace the analogous objects in state E . 

Now let Ep = {£^(-), B\ a = 1, 2, 3}, p e [0, 1], where = e^+b^PaP A 
x(j){tP), tP = aP AX • bP, and aP = pa + (1 — p)a', + (1 — p)b'. One has 

Eo = E, El = E', and an inelastic deformation between the two canonical states is 
prescribed. (In fact, one should be a little more careful than this.) 

However, the general problem seems to be rather difficult, and so attention 
is confined to solutions of (56) in this paper. Even so, the structure of solutions of 
(56) will turn out to be wide enough to provide an analogue of the slip mechanism of 
phenomenological plasticity theories, and it will have the merit that the decomposition 
of relevant changes of state is a rigorous geometrical procedure. 

As before, it is efficient to divide the discussion into the two “usual” cases: 

(i): All components of jn constant. The aim is to characterize the solutions 
of (56) in (more or less) the following way. 

• Consider states E , E', where the corresponding lattice vector fields solve (56). 

• Apply appropriate elastic deformations to E, E' so as to produce canonical 
states Ec, E'. 

• Cut states Ec, E' into pieces determined by the self-similarity property of 
Section 2.5, and apply (local) elastic deformations to each of these pieces 
so as to produce a collection of copies of one distinguished piece of each of 
E„ E;. 

• It will turn out that the one distinguished piece of Ec can be chosen so as to 
be a copy of the one distinguished piece of E' . 

• Rearrange the pieces that derive from Ec to correspond exactly to the pieces 
that derive from E' . (It will turn out that one has the correct number of pieces 
for this purpose.) 

This shows that the solutions of (56), given state E, may be characterized in 
terms of a sequence of piecewise elastic deformations and rearrangement, and this is 
the decomposition that was advertised in the introduction, in this case. The argument 
will be further split into three subcases, depending on the rank of the matrix (5^^/n). 
Note that rank (S^^/n) < 3 by virtue of the fact that (56)i,3, with vs = {S^^/n), 
implies that V A • (d^ — d^) = 0 has nontrivial solutions for (d^ — d^), given 
E, provided E' 7 ^ E. So I consider the cases rank (S^^/n) = 0, 1, 2: 

(a): Rank (S^^/n) = 0. In this case (56) is equivalent to 

V A = V A rf" = 0, n = n'. (60) 

From (60)i, d^ = d^ = for some potentials ^ = (V^^), ( = (f^) 

with det = det by (60)2- Construct canonical states Ec, E' by applying 
elastic deformations ^ to E, E', respectively (so d^ = (V^)“^Vi/r^ = for 
example). Then 



Ec = {ea. a = 1, 2, 3}, E^ = <(B); a = 1, 2, 3}, (61) 
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where vol ^{B) = vol ^(fi).Notethat^, ^ are determined only to within an arbitrary 
constant. These two constants may be chosen so that ^(5) and ^ (B) have an interior 
point in common; call it 0 . Choose a (small enough) cube surrounding the point 0 , and 
note that the corresponding “piece” of that is so distinguished is identical to the 
corresponding “piece” of E' . Cut states Ec, E' into pieces determined by translation 
of the small enough cube (in the process constructing a lattice whose unit cell is this 
cube). Note that the self-similarity of state Ec, say, determined by solutions of (40) 
with la = e a, is translation in this case; uxq = x c(jro), for some c(*). I assume 
that the two sets ^(B), with vol ^(5) = wo\^(B), may be approximated 

sufficiently well by collections of copies of the small enough unit cell, the same 
number of unit cells for ir{B) as for ^(B). (Of course, the approximation relates to 
boundary effects.) Then, a rearrangement of the finite number of cells that make up 
the approximation to f(B) produces those cells that make up the approximation to 
^{B). In the limit as the size of the small enough unit cell approaches zero, E' is 
reconstructed more and more accurately from E^. 

(b): Rank / n) = 1. Bearing in mind that (5^^ /n) = constant, the equations 
of interest are 



V A d°' = V A d“, {S“^/n)' = (<S"*/n), n' = n. (62) 

It has been remarked already that knowledge of the neutral deformations for states 
elastically related to E determines the neutral deformations of E, so that one is en- 
titled to choose a canonical state which renders the discussion of solutions of (62) 
as transparent as possible. There is, in fact, another property of (56) which allows 
one to simplify the relevant calculations. Given E, E', let Ey, E^ be defined as 
[Yahi-). B\ a = 1,2,3], B, a = 1, 2, 3}, respectively, where (yj") e 

is such that det(y^) 7 ^ 0. Then it is easy to show that (56) holds if and only if, 
when la and are replaced by y^lb and y^l'b^ respectively, the corresponding equa- 
tions also hold. It follows that one can choose (y^) so that (y “^ )^ /n) (y “^ ) J det y 

takes a particularly convenient form (where y “Ms the matrix inverse of y = (yj^ ))• 
In the case in hand, where = a b,a,b easy to show that 

• if a is parallel to b, one can assume that = e\(S>ei, 

• if a is not parallel to b, one can assume that (S^^/n) = e\ ® ^2- 

(Also, one checks a posteriori that this reduction does not affect the nature of the 
generalized elastic-plastic decomposition.) Here, I give the relevant construction just 
for the first of these two cases, since the second case is not much different to the first. 

Note that the state Ec = {<«(•)» B\ a = 1, 2, 3}, with = (1,— x^,0), 
= e^, has (S^^/n) = e\ e\. The rank of the structure map for states 

E , Ec is zero, and the first-order classifying set for Ec is identical to that of E . Hence 
one may deal with Ec instead of E . Similarly if E' is neutrally related to Ec, an elastic 
deformation fi of E' gives a state E* = {<^(*), B*; a = I, 2, 3}, where la(*) are 
the same fields as above (extended in the obvious way), and vol JS* = vol B because 
n*{fi(x)) = n'^(x) = ndx) = I, x e B, implies that det(V/x) = 1. 
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The task is to reconstruct E* from Ec by a division of the states into collections 
of “unit cells” which are congruent to one another after piecewise elastic deformation, 
followed by a rearrangement of those unit cells. Recalling the example of Section 2.5, 
apply the following piecewise elastic deformation to Ec, 

M*(ar) = x + x'^izo - Zk)e\, Zk<x^ < z*+i- (63) 

where the points . . . , z-jt, . . . , Z-i, zo = 0 , zi, . . . , Zjt, . . . are equally spaced on R, 
with separation say. This deformation has the property 

{Vui)la{x) = la(uf -Zk+ ZO), (64) 

and since e [zk, Zk-hl)^ it follows that the deformation constructs a periodic 

set of lattice vector fields, with period 2 “”^ 3 . As n -> oo, the image of B under the 
mapping iij approaches iioo(B), where Uoo = x — x^x^e\, and vol Uoo(B) = vol B, 
Now apply the same piecewise elastic deformation to E* to produce (the ob- 
vious extension of) the same periodic set of lattice vector fields defined over the 
domain Uoo(fi{B)), and note that voliioo(l^(^)) = vol(|t(R)) = vol(^). Finally, 
a rearrangement of the (congruent) unit cells that approximately make up the state 
defined as Moo(^) then gives the unit cells that approximately make up the state de- 
fined on Uoq(p(B)), and the generalized elastic-plastic deformation is accomplished 
as n oo. 

(c): Rank (S^^/n) = 2. The relevant equations are (62), again. From (62)i, 
d^ = d^ = 1, 2, 3, for some potentials r", and then (62)2 gives (V A d^) • 

Vr^ = 0, Z? = 1, 2, 3. Since rank = 2 implies that rank (V A d°) = 2, it 

follows that Vt" = Ab^ for some functions A. ^ A.^, where denotes Va£/^ 

and b^ ,b^ (say) is a choice of two linearly independent vectors of the set b^,b^,b^. 
(Hence the fields b^ ,b^,b^ are integrable in the sense each that b^ is tangent to a 
surface = constant, b = 1, 2, 3.) Clearly, Vr^ A Vr^ = 0, etc., so that each of 
the potentials r^, may be expressed in terms of any one of the potentials r" 
which is such that Vr" 7 ^ 0 locally. Choose an elastic deformation that deforms the 
surface = constant (corresponding to a potential with Vr^ 7 ^= 0) to the plane x^ = 
constant. It follows that one can assume that 

d^' =d^ + f^{x^)e^. (65) 

Then V a • {d^' - d") = V A • e^Tix^) = 0, so 

(V A . d^)(ia • e^) = S^^la3 = 0, ( 66 ) 

since at least one of the functions is to be nonzero. Also, since rank (5"^) = 2, 
( 66 ) implies that ia 3 = p(x)Va, where v = (v^) e R^ is a unit left eigenvector of 
(5^^), and p{x) is arbitrary. One can arrange, through a further elastic deformation 
which preserves the planes x^ = constant, to show that £33 = where n is a 
function of x^ only (see Davini et al. [3]). Hence each ia 3 depends only on x^ and the 
hypotheses of Theorem 7 as satisfied. So one can assume that lattice vectors depend 
only on in the canonical state under consideration. 
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That is, it is sufficient to consider canonical states of the form 

Sc = j<a(-), « = 1.2,3j, (67) 

where ta{x) = for some : K -» a = 1, 2, 3. Let E' be neutrally 

related to Ec so (cf. (65)) 

d“'=a“ + r(;cV, (68) 

and note that ( 62)3 implies 

rta -e^ = 0. (69) 

In this case, there is a slightly simpler procedure than that which was outlined at the 
beginning of this section which produces the required decomposition. Define 

X* = x - J f^{s)ta(s)ds, (70) 

and calculate that (using(69)) 

Vjc* = I - (g) =I + e^(8)r4, det(Vjc*) = l. (71) 



To construct a state E* , apply the elastic deformation jc* to state and calculate that 

= (I -f (g) f%)T = = d^\x). (72) 

Note that jc*^ = jc^, by virtue of (69) and (70), so depends only on by (67), 
( 68 ), and (72). So the deformation x* leaves planes = constant invariant and 
preserves volumes. The lattice vector fields in state E* are identical to (the obvious 
extensions) of the lattice vector fields in E' : 

K = a = 1, 2, 3} , wo\x\B) = vol(J?). (73) 

Cut states E * , E' into pieces determined by a cubic lattice with one set of lattice planes 
perpendicular to e^. It is clear that E' may be approximately reconstructed from E* 
by rearrangement (i.e., translation in planes = constant) of cells in appropriate 
layers of the lattice. Again, in the limit as the size of the unit cells approaches zero, 
E' is reconstructed more and more accurately from state E*. 

(ii): There is a nonconstant component of In; call it 0. First, an elastic 
deformation allows one to assume that 0{x) = x^. Then, using (56), one finds that 

d^' = + f^{x^)e\ f^a3 = 0, V A = S^Ha3 = 0. (74) 

From (74), one proceeds exactly as in case (i)c above to construct the generalized 
elastic-plastic decomposition. 
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Dedicated to Alan Weinstein on the occasion of his 60th birthday. 



Abstract. Motivated by Poisson structures for complex fluids, such as the Poisson 
structure for spin glasses given in Holm and Kupershmidt [1988], we investigate 
a general construction of Poisson brackets with cocycles. Connections with the 
construction of compatible brackets found in the theory of integrable systems are 
also briefly discussed. 



3.1. Introduction 

Purpose of this paper. The goal of this paper is to explore a general construction 
of Poisson brackets with cocycles and to link it to questions of compatibility similar 
to those encountered in the theory of integrable systems. The study is motivated by 
the specific brackets for spin glasses studied in Holm and Kupershmidt [1988]. 

In the body of the paper, we first describe several examples of Poisson brackets 
as reduced versions of, in principle, simpler brackets. Then we will show how this 
technique can be applied to study the compatibility of Poisson brackets. In the last 
section, we give the example of spin glasses and show that the Jacobi identity for the 
bracket given in Holm and Kupershmidt [1988] is a result of a cocycle identity. 

Background. It is a remarkable fact that many important examples of Poisson 
brackets are reduced versions of canonical Poisson brackets. Perhaps the most basic 
case of this may be described as follows. Start with a configuration manifold Q with 
a Lie group G acting on Q in such a way that Q ^ Q/G is a principal bundle (for 
instance, assume that the action is free and proper). Consider the cotangent space T* Q 
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with its canonical symplectic and hence Poisson structure. As with general quotients 
of Poisson manifolds (see, for instance, Marsden and Ratiu [1999] for the relevant 
basic background material), the quotient space (T* Q)/G inherits a Poisson structure 
from T*Q ina, natural way. 

This construction of (T* Q)/G along with its quotient Poisson structure is a very 
important and rather general method for finding Poisson structures in mathematical 
physics, including fluid and plasma systems, and for obtaining brackets that satisfy 
the Jacobi identity by the nature of their construction (whereas verifying the Jacobi 
identity “by hand” could be quite arduous); see, for instance, Marsden and Weinstein 
[1982], Marsden and Weinstein [1983], Marsden, Ratiu, and Weinstein [1984a,b], 
Holm, Marsden, Ratiu, and Weinstein [1985], and Cendra, Holm, Hoyle, and Marsden 
[1998]. 

The detailed mathematical structure of the reduced Poisson brackets on quotients 
(T*Q)IG has itself a long and interesting history going back to the work of Sternberg 
[1977], Weinstein [1978], and Montgomery [1984], Montgomery [1986] on particles 
in Yang-Mills fields and the bundle picture in mechanics. Recent works in this area, 
such as Cendra, Marsden, and Ratiu [2001a] and Marsden and Perlmutter [2000] take 
the view that one should choose a connection on the shape space bundle QjG 
and then realize (T* Q)IGa^ the Whitney sum bundle T* ( 2/ 0 5* » where g is the 

Lie algebra of G, g* is its dual space, and where g* is the coadjoint bundle associated 
to the given G-action on Q and the coadjoint action of G on g*. The reduced Hamilton 
equations on this space are called the Hamilton-Poincare equations and are introduced 
and studied in Cendra, Marsden, Pekarsky, and Ratiu [2003]. 

However, the construction of brackets on quotients of cotangent bundles is not 
the only case in which the idea of reduction is important to unify the study of otherwise 
mysterious Poisson brackets, some of which are well known from an algebraic point 
of view, namely that for spin glasses in Holm and Kupershmidt [1988]. A key feature 
of such brackets is the presence of cocycle terms. 

Goals of this paper. In this paper we shall present a fairly general construction 
of Poisson brackets with cocycles and show a simple reduction context for their 
study. We then show that the brackets of Holm and Kupershmidt [1988] may be 
viewed as examples of this construction. We will also make some suggestive links 
with compatible Poisson structures in the same sense as are found in the theory of 
integrable systems. 

Future directions. One of the key ideas that will be further developed in the future, 
especially in the context of complex fluids and similar examples, is to realize these 
cocycle terms as curvature (magnetic) terms associated with the reduction process 
itself. This point of view was explored in some detail in Marsden, Misiolek, Perlmutter, 
and Ratiu [1998] and Marsden, Misiolek, Ortega, Perlmutter, and Ratiu [2002] in the 
context of reduction by stages. The corresponding theory of Lagrangian reduction 
by stages was developed in Cendra, Marsden, and Ratiu [2001a] and it was used in 
Holm [2002] to study the variational approach to complex fluids. A similar reduction 
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by stages method was used in Patrick [1999], who studied the Landau-Lifschitz 
equations, which occurs in magnetic materials. 

It is our hope that our results will help provide a better understanding of not 
only spin glasses but also other important examples of complex fluids. The reader 
should realize that even the example of spin glasses has some obviously interesting 
additional and fundamental properties that require a deeper understanding, such as the 
following: When the spin glass bracket is “reduced” from a connection description to 
a “curvature” description, the cocycle disappears, as Holm and Kupershmidt [1988] 
point out. This seemingly miraculous property suggests that the overall reduction 
procedure is simply a Lie-Poisson reduction from the cotangent bundle of a group 
to the dual of its Lie algebra and that this one-step reduction may be realized as a 
two-stage reduction (the first by a normal subgroup). The cocycle is introduced at the 
first step of the reduction and then disappears in the second step, as it must since the 
final bracket is Lie-Poisson. 

Another idea that requires further development is the connection of the global 
Poisson point of view presented here with the local, variational multisymplectic field 
theoretic point of view (see, for instance, Gotay, Isenberg, and Marsden [1997], Mars- 
den, Patrick, and Shkoller [1998], and Castrillon Lopez, Ratiu, and Shkoller [2000]). 
In particular, it would be very interesting to explore the issue of cocycles and reduc- 
tion by stages in the multisymplectic context and to link this theory to that of complex 
fluids. This is a relatively ambitious project that we do not address directly in this 
paper. 

Parts of this paper are pedagogical in that we sometimes give two proofs of 
a result, sometimes by a direct coordinate computation; we feel that this may be 
beneficial for readers who want to see the same result from another viewpoint or who 
prefer an emphasis on coordinate calculations. 



Preliminaries and notation. Let (M, {, )) be a Poisson manifold (see, for exam- 
ple, Marsden and Ratiu [1999] for an exposition and further references). It is well 
known that the Poisson structure { , }, regarded as a skew-symmetric bilinear mapping 
on J^{M) X .F(M ), where J-{M) denotes the space of smooth real- valued functions on 
M, may alternatively be given in terms of a contravariant skew-symmetric 2-tensor 
field C € T^(M) (the space of all 2-tensors with indices up) on M. The relation 
between (, } and C is given by the identity 

{f.g] = C{df,dg) (1) 



for any f, g e !F{M), Let us choose local coordinates x' on M, where / = 1, . . . , m, 
and m is the dimension of M. Then the Poisson bracket may be written as follows 



dx^dxJ' 



where are the components of C in the chosen coordinate system and where the 
summation convention is in force. The Jacobi identity for the Poisson structure is, as 
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is well known, equivalent to the coordinate condition 






dx>- 



dx’’ 



+ C*='^ =0 

dx'' 



for al\i, j,k = 1, ... ,m. 



( 2 ) 



3.2. A bracket associated to a 2-form on a symplectic manifold 

Definition 3.2.1. Let (P, co) be a symplectic manifold, and let Y, be a given 2-form 
on P. For any given f,ge !F(P) the phase space E-bracket off and g is defined by 

{f,g)^ = Y{Xf,X,), (3) 

where X / and Xg are the Hamiltonian vector fields associated to f and g, respectively. 

At this point we do not claim anything about the phase space E -bracket being 
a Poisson structure. We shall address this issue in due course. 

Recall that the Lagrange bracket of two vector fields X and 7 on a symplectic 
manifold (P, o)) is defined by the expression [[X, 7]] := o){X, 7) (see Abraham and 
Marsden [1978], p. 196). Thus the phase space E-bracket of / and g may be viewed 
as a generalization of the Lagrange bracket of X/ and Xg to an arbitrary 2-form on P. 
Note also that {/, = {/, g} is the usual Poisson bracket associated to a symplectic 

manifold. 

Let P = T*Q and denote by co the canonical symplectic form on P. Let Eq 
be a 2-form on Q. If ttq : T* Q Q is the canonical projection, E = tt^Eq is 
a 2-form on T*Q and one can then form the corresponding phase space E -bracket 
using (3). 

It is clear that the phase space E -bracket is skew-symmetric and is a derivation 
on each factor. The natural question is therefore to find the conditions on E for { , }s 
to satisfy the Jacobi identity. We shall give below intrinsic as well as coordinate proofs 
of our assertions and shall study this problem in an increasing degree of generality. 



The phase space E -bracket on the cotangent bundle of a Banach space. We 
study the simplest case of a general construction by first considering the case of vector 
spaces. Let V be a Banach space and Eq : V x V -> R be a bilinear skew-symmetric 
map. The lift of this bilinear form to a bilinear form on V x V* by means of the 
projection map Try : V x V* -> V is given by 



S((Ui,/Xi),(U2, /X2)) = ^0 (VuV2). 



Consider the cotangent bundle r*V = V x V*. Note that the canonical pro- 
jection 7Ty : V X V* ^ y is the projection on the first factor. To compute the phase 
space E-bracket, let / e T(y x V*), the space of real-valued functions on V x V*. 

Recall that the functional derivative of f e T{V*) at a point /x e V* is the 
element 8f/8p. e V defined by the identity 



D/(m) • V = 



v.V 

Sfi 



for all fi,v€V*, 



(4) 
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where D/(/x) denotes the Frechet derivative of / : V* R. 

Since the Hamiltonian structure is canonical, X/(u, /x) = (5//<5/i, —8f/8v), 
and so we get 

{/. 8h O'. = S {Xf, Xg) (V, n) = So 

for all u € V and e V*. This phase space E -bracket is actually a Poisson bracket, 
as the next theorem shows. 

Theorem 3.2.2. Let V be a Banach space, Eo : V y. V ^ be a bilinear skew- 
symmetric map, and E be the induced bilinear skew-symmetric form on V x V* 
defined above. For given /, g : V* -> R, the phase space Ti-bracket on V x V* 

= !X€V*, (5) 

satisfies the Jacobi identity. In particular, for f and g independent of the first factor, 
we have a Poisson bracket on V*. 

A more general theorem will be proved in Theorem 3.2.6 below. 

The momentum E -bracket on the dual of a Banach space. Interestingly, one 
can also get a Poisson structure only on V* by a similar construction. 

Definition 3.2.3. Let V be a Banach space and Y, \ V xV "^be a bilinear skew- 
symmetric form. For given /, g : V* R, momentum E -bracket on functions 
on V* is defined by 

l/.s)rW = i:(|.|), M€V*. (6) 

We use the term “momentum E -bracket” since we think of elements of V* as 
momentum variables and the functions under consideration depend on fx e V* and 
not on (u, p) e T*V, as was the case in the preceding theorem. 

Remarkably, the momentum E -bracket is also a Poisson bracket. 

Theorem 3.2.4. The momentum Tt-bracket given in Definition 3.2.3 satisfies the Ja- 
cobi identity and therefore defines a Poisson bracket on V*. 

We now give three proofs of Theorem 3.2.4, starting with a local one in finite 
dimensions. 

Calculus-based proof. Assume that V is finite dimensional. An immediate corollary 
of (2) is that any skew-symmetric constant tensor field C e 7^(V) on a vector 
space defines a Poisson bracket. In particular, the bracket given by (6) is a Poisson 
bracket. □ 

The following second proof also works for infinite dimensional Banach spaces. 
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Direct proof. Of course, as usual, the only thing we need to worry about is the Jacobi 
identity. Denote by : V V* the map given by rj) = E(^, t]), for any 

e V, where { , ) : V* x V R denotes the pairing between the vector space V 
and its dual V*. The following formula, obtained by differentiating (4) relative to /x, 
will be used in the computation below: 

D (ti) • vj = (v, (IX) ■ = DV(m)(v, p), (7) 

for all fjL,v, p e V*. Thus, one has 

= )+D^S(M)(r*(|). ■) (8) 

SO that for any f, g,h : V* ^ R, it follows that 






= dV(m) (i:‘ (I) , (|)) - (|) ■ • 



It readily follows that if one adds the other two terms obtained by circular permu- 
tations and uses the symmetry of the second derivative, the sum will vanish. This 
proves that { , }e is a Poisson bracket on V*. □ 



The third proof is by reduction. It may seem the most complicated at first, but 
in fact, it is the one that gives the most insight into generalizations of the result. 



Proof by reduction. Start with the cotangent bundle T*V = V x V*. Let V act on 
y X V* as follows (cotangent lift of translations together with a momentum shift): 

h ■ (?. /^) = (^? + 'i. p + • 

One checks that this action is Poisson with respect to the canonical cotangent bracket 



{/. «)can (?. M) 



\sf S^l| W Sixj- 



Now we simply Poisson reduce. The quotient space (V x V*) /V is isomorphic to 
V* by the quotient map 



7t : (1,^) (->• /i- 

Since for / : V* -> R we have 



H^*f) ^ 1 / Sf 

2 \S(^ - 1 



H^*f) ^ Sf 
Sfi 5(At-^EH^))’ 



and 
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it follows that 






S(n*f) S(7t*g)\ IS(n*g) S(n*f) 



’ Sfi 
Sf 






^±\ 



Sfji 



) 



Therefore, {/, ^}x: is the reduced bracket and hence is a Poisson bracket on V*. □ 



The E -bracket on T* 2- Let Q be any given manifold and let Eq be a 2-form on 
Q. Let tcq : T^Q QbQ the natural projection. Then E = ijr^Eo is a 2-form on 
7* Q and thus it has a phase space E -bracket associated to it, as explained before. 
For any given 2-form Eo on Q, one has a well-defined map 

Jt^o . t*tQ T*Q 

given in local coordinates by 

Tr^0{q‘,Sq‘, Pi,Spi) := (q‘,Spi - ]^'EQ(q)ji8q^^ . 

The map is defined at an element w 6 T*TQ and paired with an element 

Uq€TqQ, by 

Uq) = {jxT*QOKQ^^ (w), Uq), (9) 

where e r^2 -Here, \TT*Q 7* 2 denotes the tangent bundle projection 

and kq : TT*Q 7*72 is canonical isomorphism. These maps are related to 
some constructions given by Tulczyjew [1977], but for expository clarity, we shall 
recall the details here. 



Intrinsic characterization of the map kq . First of all, choose a local trivializa- 
tion of 2» in which Q is represented as an open set f/ in a Banach space E. With such 
a choice, 77* 2 is represented by {U x E*) x(E x E*), while 7*72 is represented 
by (U X E) X (£■* X £*). In this representation, the map kq will turn out to be 
given by 

{q, p, 8q, 8p) {q, 8q, 8p, p). 

We will show that the map kq is the unique map that intertwines two sets of 
maps. These maps are given as follows: 

1. First set of maps. Consider the following two maps: 

T7tQ:TT^Q->TQ, 
tttq : T^TQ TQ, 
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which are the obvious maps (recall that we write ttr : T*R R for the 
cotangent projection). The first commutation condition that will be used to 
define kq is that 

7tTQ o Kq = Ttcq. 

Thus, the diagram in Figure 1 should commute. 



T ( rQ ) 



r ( TQ ) 




Figure i . Commutative diagram for the cotangent analogue of the 
canonical involution. 

2. Second set of maps. The second set of maps is the following: 
tt*q:TT*Q^T*Q, 

^0 . J*JQ J*Q 

Recall that the tangent bundle projection of 7 i? to R is denoted xr : TR 
— > R. 

We need to explain the map let e Q and let Uq e Tq Q. 

Then 

= { av ^,\ CT ( Ug , Vg )) , 

where 

ver(M^, Vq)= ^ (Vq 4 - tUq) E Ty^T Q 
\t=o 

denotes the vertical lift of Uq along Vq. 

The second conunutation condition is that 

7T^ O Kq = XJ*Q. 

In other words, the diagram in Figure 2 commutes. 

In a natural local trivialization, these four maps are readily checked to be 
given by 

TTtQiq, p, 8q, 8p) = (q, 8q), 

TtTQiq.Sq. pjp) = (qJq). 

'^T*Q{q, p.Sq.^p) = (q.p). 
n^{q,8q,p,8p) = (^,5p). 
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T{TQ) ^ T{TQ) 



T*Q 

Figure 2 . The second commutative diagram for the cotangent ana- 
logue of the canonical involution. 

from which it is easy to see that the commutation conditions are satisfied with the 
coordinate formula for kq. It is clear that this uniquely characterizes the map kq. We 
summarize what we have proved in the following. 

Proposition 3.2.5. For any manifold Q, there is a unique diffeomorphism 

kq : TT*Q -> T^TQ 

such that the above two diagrams commute. 

In summary, jg defined by equation (9), where kq is the map defined in the 
preceding proposition. 

Cotangent bundle E -bracket. Now we are ready to give the generalization of 
Theorem 3.2.2. 

Theorem 3.2.6. Let Q be a manifold and let Eq be a 2-form on Q. Endow T*T Q 
with the canonical cotangent bundle symplectic structure. There is a unique Poisson 
structure onT*Q obtained by declaring tt : T*T Q -> T*Q to be a Poisson map; 

in fact, this bracket on T* Q is given by the cotangent Eo-bracket 

where Fg e !F(T*Q) and where the fiber functional derivative 8f/8p e TgQ is 
defined by 

, 8f\ d 
Spl dt 

forall{q,p')eT;Q. 

We give two proofs. The first is a brute force coordinate proof and the second 
is by reduction. 

Coordinate proof. Let the 2-form Eq be written in components as 

Eo = Tlijdq^ (S>dqf 






60 



Hernan Cendra, Jerrold Marsden, and Tudor S. Ratiu 



where the sum is understood to be over all /, j (no restriction on indices, such as 
i < j, which would introduce factors of 2 ) and where E/y is a function of q alone. 
Then we have 



{/. 5}So 



‘^dpi dpj' 



The only axiom that is not obvious is the Jacobi identity. To check it, we compute 
directly in coordinates as follows: 



a/ a / dg dh\ 

V 8^S 






dh 



dpk dpi d Pi dpj 






df dg d^h 
dpk dpi dptdpj 



Therefore, we get 



{/. {g, h}^o)^o + {^> {^' /IeoJso + {h, {/, glSolSo 

-V V. V dg d^ 

dpk dpidpi dpj dpk dpi dpidpj 

V.. V V.. dh ay 

dpkdpidpidpj dpkdpi dpidpj 

o-V V..^* Jh 

dpk dpi d Pi dpj dpk dpi dpidpj ' 

The sum of the first and the last term is 



St/S,' 



df d^g 



dpk dpidpi dpj 

^ ^ 9/ d^g 



9/» V, V, 9/ 
+ ^kl^ij 



d^g dh 



dh 



dpk dpidpi dpj 

df d^g 



dpi dpidpj dpk 

df d^g 



(^kl^ij + '^ji'^kl) 





' dpk dpidpi dpj 

dh 



dpk dpi d Pi dpj ’ 



where in the first equality we renamed the summation indices and in the second 
we used symmetry of the mixed partial derivatives. This expression vanishes since 
Tii j = — Sy/ . There are two similar pairs whose sum is also 0 by the same argument. 
Thus, Jacobi’s identity holds. 

Now we shall prove that ; ( 7 * 7 { » }) ^ (T*Q^ { » Ieq) is a Poisson 
map. Let ^ : T* Q -> R and denote by (q ^ , 8q ^ , pi , 8pi) the canonical coordinates 
on r*T(2. By definition of pull-back and the map tt^®, we have 



f{q‘,Sq‘,pi,Spi) = / (q‘,Spi - 
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Therefore, by the chain rule, we have the following identities: 

a ^ k 

dq^ dq^ 2 dpj dq^ ^ 

8(;r^o)*/ _ 1 

3(3^)' 2 

3(7r^o)*/ 

dpi 

Bf _ 3/ 

3(5p)i 3pi ’ 

where the right-hand sides are evaluated at the point (q' , Spi — | "LjiSqj). Therefore, 
the canonical Poisson bracket of and (7r^®)*g is given by 

3 / 3 (jT^o)* g 3 / a g 

Bq‘ Bpi B{8qy B(Sp)i 

B{n^oYgB{n^)*f 3 (;r^o)* g 3 (tt^Q)*/ 

dq‘ Bpi d{Sq)‘ B(Sp)i 

__lv. K 

2 dpk Bpi 2 '* 3pt 3p; 

= S,*|^|^ = (7r^o)*{/,g}Eo. 
dpk Spi 

Uniqueness follows since is a surjective submersion. □ 

Proof by reduction. The strategy of the proof is to reduce the canonical Poisson struc- 
ture on T*TQ to 7*2 t>y means of the map Let us first recall some general 
facts about how one does such a procedure. 

To motivate the construction, we note that if 7T : P -> is a submersive Poisson 
map from the Poisson manifold P to the Poisson manifold R, and if /, g e T(P) are 
such that for z e P, df(z), dg(z) vanish on ker 77 t(z), then d[f, g}(z) vanishes on 
ker Tn(z) as well. In fact, the condition on a 1-form a e TfP needed for it to push 
down to a well-defined 1-form at n(z) e R is precisely that a vanish on the kernel 
of r^TT. 

Now suppose that we want R to inherit a Poisson structure from P . The condition 
needed for the Poisson structure to push down from P to R under a submersion 
f : P R is exactly that if /, g e J^{P) are such that for z ^ P, df{z), dg{z) 
vanish on ker Ttt ( z), then d{/, g}(z) vanishes on ker 7 7 t(z). If this condition holds, 
then it is a straightforward procedure to compute the reduced bracket on /?, as in the 
last part of the preceding proof. This technique has been developed in a more general 
context by Marsden and Ratiu [1986] and has been used in interesting ways in the 
study of integrable systems (see, for instance, Pedroni [1995]). 

In our case, consider the foliation of 7* 7 2 whose leaves are the level sets of 
the map To establish the theorem it is enough to show that for all functions 
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f,g : T*T Q — >■ R, the condition df \ ker =0 and dg \ ker T jr = 0 implies 
d{f, g}| ker = 0. Differentiating along a curve Sq‘(t)), we find that 

^ i:ij(q)Sq‘ = + llij(q)8q‘. 

dt ,^o 

It follows that the tangent of the map 7t^ is given by 
Tjt^^(q‘,Sq', Pi, Spi,q, Sq‘, Pi,8pi) 



= - 2^iji<})Sq 



‘,q\Spj - i ^ 



^^^q’"Sq‘ + T>ij{q)8q' 



Thus, a typical tangent vector {q\8q\ pi, Spi ,q,8q\ pi , 5p/) to the foliation can be 
written in the form 

^q‘,Sq‘,pi,Spi,0, Sq‘,pi, ]^'Lijiq)h^ ■ 

From this it follows that, for a given f{q ^ , 8q ' , pt , 8pt), the condition 



df{q^. W , Pi , ^Pi) • (0, 8q\ Pi,- ^ij (q)Sq^)=0 



is equivalent to 



= 0 . ( 11 ) 

dSq' 2 dSpj ^ ^ 

Assume that / and g satisfy (10) and (11). We must show that the canonical Poisson 
bracket on T*TQ given by 



df 9g df dg / dg df dg df 

1 O I 'r\C*_iOP_ 






2^‘^^^^dSpj dSpi 



dq‘ dpi dSq' dSpi \dq' dpi dSq' dSpi J 
also satisfies (10) and (11). One can easily check that if both / and g satisfy (10) and 
(11), then 

[f,g} 2 ( dSpj dSpi j 2 ^ dSpj dSpi 

^ ... 9/ 

Using this expression it follows directly that (10) holds for {/, g}, that is, 

9{/.g) 0 

dpi 

It remains to show that (11) holds for {/, g}, that is, 






df dg \ 



+ ;tS,/(9)t 7— S/i(^) 



9/ 9g 
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Let us write, for short. 



and 



d df dg 

rYilk(Q) = OCi 

1 9 9/9^ 



=ft- 



The ^th components of a and p are 

d 



OCk = 






= E 



dSql^ 

3 V Bg 



( Bf \ 
\dSpi dSpj) 



+ S; 



and 



Bf B^g 

dSql^dSpi dSpj dSpi dSql^dSpj 

Bf Bg 



fik - (s,v )) 

-iv . \ 9/ \ 

‘1 ^SpidSpidSpj)'^ dSpi dSpidSpj dSql^ ) ‘ 



We have, by (11), 



and, similarly. 



9V 



ay 

dSpidSq>^ 2 ""dSpidSpi 



B^g 1 ^ 9y 



dSpidSq^ 2 '^dSpidSpi' 

Substituting these expressions for d^ffdSpi dSq^ and d^g/dSpi dSq^ into the expres- 
sion for ajc gives that ak Pk = ^ 



3.3. Compatible brackets 



Recall (see, for instance, Marsden and Weinstein [ 1 983] or Marsden and Ratiu [ 1 999] ) 
that if g is a Lie algebra, its dual g* is a Poisson manifold relative to the Lie-Poisson 
bracket 



{/, sKf^) = 




“ V 

_8p.' 8pi^ 



)■ 



( 12 ) 



where /, g : g* -> M, € g*, and ( , > : g* x g -> R denotes the canonical pairing 
between the Lie algebra and its dual. We now prove that the Lie-Poisson bracket and 
the momentum S -bracket are compatible exactly when E is a cocycle. 
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Theorem 3.3.1. Let q be a Lie algebra and E : g x g M. be a bilinear skew- 
symmetric map. For /, g : -> R, the Lie-Poisson bracket and the T, -bracket are 

compatible, that is, 

:= [f,g} + [f,8U 

is also a Poisson bracket on g*, if and only if E is a 2-cocycle, that is, E satisfies the 
identity 

E([?,r;],O + 5]([/7,f],?) + E([f,^],r 7 ) = 0 for all f , r/, ? € (13) 

Remarks 

1 . As an important special case of a cocycle E , one can take an arbitrary cobound- 
ary, which is also known as the modified or constant Lie-Poisson bracket', that is, for 
each given v e g* define 

Sy(^r/) = (v, [^r]]), ^,T]eg 

and consider the associated Poisson bracket { , }^^ This Poisson bracket appears in 
connection with several integrable systems and gives rise, via the associated recursion 
operator, to the commuting integrals of motion (see, for instance, Ratiu [1980] for 
details). 

2. On p. 213 (Chapter IV, Section 5.1 of the 1987 translation) of Libermann and 
Marie [1987], there is a note on the sum of the canonical bracket on the dual of a Lie 
algebra plus a cocycle bracket which gives essentially the content of the preceding 
theorem. However, we give very direct proofs and, in addition, we give a more general 
version, not only because we do it in infinite dimensions but also because of the more 
general Theorem 3.3.3, from which Theorem 3.3.1 follows as a special case. 

There are three interesting ways to prove the preceding theorem. 



Coordinate proof. Let (^i , . . . , be a basis of the Lie algebra g and denote by Cfj 
the structure constants; that is, Cfj are defined by 



The Jacobi identity for the Lie bracket is equivalent to 



cfrCj, + qA, + cicr=o 



(14) 



for all i, j,k,s = I, . . . ,n. The Lie-Poisson bracket has the coordinate expression 






V Sg 



d Pi dpj' 

where p = pi^^ and {^^ . . . , ^'*} is the dual basis of {^i , . . . , in g*. 
The momentum E -bracket has the coordinate expression 

opi dpj 




3. Cocycles, Compatibility, and Poisson Brackets for Complex Fluids 65 



where, as before, E = ( 8 )^^ (with a sum over all i, j). Therefore, the coordinate 

expression of the sum bracket is 

Recall that (2) shows how to express the Jacobi identity on a Poisson manifold 
(P, { , }) in terms of the components of the Poisson tensor. For our case, the quantity 
we must show vanishes for the validity of the Jacobi identity, is 

{t^sC^ir + ^ir) + ^*<) 

+ + ^kr) 

= + E,>) C[^ + {ilsCl^ + S*r) 

= IX, (cicr, + + ciqj) + + E*,cr.. 

The term + q^C-j) vanishes by (14). The rest vanishes if and only 

if the cocycle identity (13) holds, since 



E(r?, [C, 0]) = 



for e 0 . 



□ 



Direct proof . To show that the two Poisson brackets are compatible we must establish 
Jacobi’s identity, so we proceed to a direct computation of the term 

{{/, = {{/, g], h] + {{/, g}s, h} + {{/, g], h }2 + {{/, g)£, Ms. 

The sum of the first term and the other two obtained by circular permutations of 
(Lg^h) vanishes because the Lie-Poisson bracket satisfies the Jacobi identity. The 
same is true for the last term because the E -bracket satisfies the Jacobi identity, as 
we have already proved. To compute the two cross terms we shall need the following 
formula which is readily checked: 

^{/. ^}(M) = M, ■) + p, ■) . 

(15) 



Therefore, using ( 8 ) and (15) yields 
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{{f,gh,h}iix) + {{f,g],hU(lx) 

I r ^ . Sh'W ^ / s , ^ Sh\ 

~ V’ ^ sA) ^ Six ) 

= - (adW ^{/. 8)j:) - (^) , ^{/, ^}) 

= DV(/^) (s' (0) , /r) - D^giix) (s» (g) , ad,\/,^ /i) 

s /r V 

\ _5/x’ 5/x J ’ 5/x / 

(adW O. 5:‘ (^)) - D"sW (adW "•!:"' (^)) ■ 



+ 
+ ] 



The sum of the four terms involving second derivatives and the terms obtained by 
circular permutations of (/, g, h) vanishes in view of the symmetry of the second 
derivatives. 

The sum ofthe third term and those obtained by circular permutations of (/, g, h) 
vanishes if and only if the cocycle identity (13) holds. □ 



Reduction proof . This will be discussed after we prove the more general Theo- 
rem 3.3.3. 



Aphne Poisson structures. The following calculations are similar to some of the 
previous coordinate calculations, but reveal that, instead of dealing with the dual of 
a Lie algebra, which is a Poisson manifold on a vector space in which the Poisson 
tensor depends linearly on jx e Q*, one can as well start with a slightly more general 
situation, in which the bracket depends affinely on /x. 

Let (M, { , }) be a Poisson manifold and assume for the moment that M is a 
vector space. The Poisson structure { , } defines a tensor field C e T^{M) such that, 
for any given /, g G (1) is satisfied. Let us choose linear coordinates on 

M, where i = 1, . . . , m and m is the dimension of M. Then the Poisson structure is 
written as 



dx^ BxJ ’ 



where are the components of C in the chosen coordinate system. An immediate 
corollary of (2), which we have already utilized, is that for any skew-symmetric 
constant tensor C e Tq(M) the structure on M defined by (1) defines a Poisson 
bracket. 

Now let us consider a skew-symmetric tensor field C € Tq{M) depending 
linearly on x, that is, 

C = C.JcL (16) 



Then the Jacobi identity (2) becomes 

+ CfC*' + = 0. 



( 17 ) 
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By differentiation with respect to we obtain the equivalent condition 

+ c/'c*' + cf c'-'' = 0, (18) 

which, of course, coincides with (14). Let a 6 M be fixed and consider the con- 
stant Poisson structure C(a) = Cga^. Then consider the structure C + C(a), whose 
components are -f {a) = . The Jacobi identity (2) becomes 

(;c" + + C^/c‘/) = 0, (19) 

which is satisfied because of ( 1 8) . We have therefore proven that C -f C (a ) is a Poisson 
structure that depends affinely on x e M. 



Example. Let M = 9 * , where 9 is a Lie algebra. Then, as we have remarked already 
above, the Lie-Poisson bracket is given by 

a/ 






BiXj’ 



( 20 ) 



where Cfj are the structure constants of the Lie algebra 9 , which depends linearly on 
fji. From the previous results we can conclude that, for any /xo> the expression 

defines a Poisson bracket. 



( 21 ) 



We must remark that any linear Poisson structure on a vector space, in the sense 
of (16), comes from a Lie algebra structure on the dual of the vector space. 

This discussion proves the following theorem. 

Theorem 3.3.2. IfC is a Poisson structure on the vector space M that depends 
linearly on x and if a e M is fixed then C + C(a) is a Poisson structure. 

Compatibility of brackets on T* 2- Now we shall prove the following theorem, 
from which Theorem 3.3.1 can be deduced by reduction, as we will show in a moment. 

Theorem 3.3.3. Let Q be a manifold and Do a 2-form on Q. The bracket 

}o + {,}so 

on T*Q, where { , }o is the canonical Poisson bracket and where { , Ieq is the bracket 
defined in Theorem 3.2.6, satisfies the Jacobi identity if and only if Ho Is closed. 

Proof. By working in a canonical (Darboux) chart, we can assume without loss of 
generality that Q is an open subset of R'*. A point of T^Q is denoted in the usual 
way ns {q, p) = (^^ . . . , ^”, /?i, . . . , pn)- However, in order to use the notation 
introduced in the previous paragraph, we shall denote . . . , /?l, . . . , /?„) = 

(JC^...,JC'*,...,JC2'*). 

The canonical Poisson structure on T* (2 is given by the constant tensor field 
defined by Cq*^ = if j > n, if i > n, and Cq = 0 otherwise. 

Now let Eo be a 2-form on Q. Then we have the bracket [F, defined 
in Theorem 3.2.6, whose associated tensor field, using the notation x^ instead of 
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or Pi for the coordinates of a point of T*Q, is given by = 2T,i-nj-n if 
i > n and j > n; = 0 , otherwise. Define the tensor field . It is 

antisymmetric because Cq and are antisymmetric. Now let us see which condition 
Do must satisfy in order for to be the associated tensor field to a Poisson bracket. 
This can be achieved using the coordinate expression of the Jacobi identity (2). Taking 
into account that satisfies the Jacobi identity and the fact that the tensor field Cq 
is constant, it can be easily shown that the Jacobi identity reduces to 



. dci^ 

/^ir Zj 






= 0 , 



( 22 ) 



from which, using the special form of Cq , one can easily deduce the equivalent 
condition 



Q 

dq^ dqj dq^ 

This last identity simply states that dUo = 0; that is, Eq is a closed 2-form. 



(23) 

□ 



Proof of Theorem 3.3.1 by reduction. \i Q = G is di Lie group and Eq is a left- 
invariant closed 2-form on G, then { , jg a Poisson bracket on T*G if and only if 
Eo is closed, which is equivalent to the statement that the restriction E of Eq to the 
Lie algebra 9 of G is a 2-cocycle. (This is easily seen by writing the standard formula 
for the 3-form dEo acting on three left invariant vector fields as a sum of six terms. 
Three of these vanish by left invariance and the remaining terms are the terms that 
are involved in the cocycle identity.) From this and the previous theorem we obtain 
easily a proof of Theorem 3.3. 1 , by reduction. □ 






G^di 

0 






0 



Al\/SHm\ 



S^dk + t^yAl 
0 



I 



SH/bp 
SH/SGp 
\SH/SA^J 



3.4. Spin glasses 

Holm and Kupershmidt [1988] show that the evolution equations for spin glasses is 
governed by an affine bracket on the dual of a Lie algebra. Formula (2.26a) of that 
article gives 

/ P/ \ / Pk^i + hPi p^i O^di dkA\ 

p _ ^ dkp 0 

Gqt ^kGot 0 

\AV Vldi + Al, 0 bpi+t«^A] u / 

The Hamiltonian matrix in this equation is given by the Lie-Poisson structure on 
the dual of the Lie algebra © ((A^ 09 )® ® 9 *))) augmented by 

a 2-cocycle. We shall show below how this bracket is obtained as a corollary of 
Theorem 3.3.1 and in the process introduce the dynamic variables and explain the 
above notation. 

Consider two Lie algebras a, b and a vector space U. Assume that G is a 
representation space for a and that a acts on b, as a Lie algebra; i.e., there is a Lie 
algebra homomorphism a -> Der(b), where Der(b) denotes the derivations of the 
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Lie algebra Der (b) . Consider £/ 0 b as a direct sum of Lie algebras, U with the trivial 
Lie algebra structure. Then the diagonal action of a on (7 0 b is a Lie algebra action 
of a on £/ 0 b and one can form the semidirect product Lie algebra a (§)(£/ 0 b) whose 
bracket is given by 

= ([fl> ^2l» ’ W2 — ^2 • Wl, • r^2 “■ t2 • + 1^1 J ^2]) » 

for e a, rji € b, and Ui e U. 

Now let us assume that b = c© W, for c a Lie algebra and W a representation 
space of c. In addition, assume that a acts on c as a Lie algebra, that W is also an 
a-module, and that these three actions are compatible in the sense that the following 
identity holds: 

^ . (a • m;) = (^ • O') • u; + a • • u;) 

for ^ 6 a, a € c, and w e W. Then the diagonal action of a on c© W^, given by 
^ • (a, lu) = (f • cr, f • u;), f € a, a € c, and u; € W, is a Lie algebra action of a on 
c© W. Then, taking b = c© W’, in (24) we get 

[(fl, wi,ai, wi), (^ 1 , wi,ai,u;i)] 

= ([|l» I 2 ], • «2 ~ I 2 • Wl, |l • <72 ~ I 2 • Cri 4- [<Tl,Or2], 

• W2 - ^2' W\ + CJi • W2- CT2 • W \) . (25) 

As a particular case, we shall take 

• a = X(W), the Lie algebra of all vector fields on E”, 

• W = where Q is the space of all tensor fields on E” of a given type 

and y is a given vector space, 

• c = A® ( 8 ) 9 , where A^ denotes the space of smooth functions in E" and 9 is 
a given finite dimensional Lie algebra, with bracket 

[(/l 0 ^ 1 ), (/2 0 ? 2 )] = / 1/2 0 [fl, ? 2 ], 
for fi e A^ and € 9 , / = 1 , 2 . 

In addition, we assume that the a-representation on W is given by 

X • (ct> 0 u) = £x<i> 0 Vj 

for X € 3t(E'^), o) G f2, u € V and where £ denotes the Lie derivative. The Lie 
algebra action of a on c is given by 

:^-(/ 0 |)=£x/ 0 t, 

for X e 3C(E"), / g A^, ^ g 9 . The c-representation on W is given by 
(f ® • (o> 0 v) = f(0 ® ^ • V 

for I G 9 , / G A^, V e V, 0 ) e Q. Using the derivation property of the Lie 
derivative, a direct verification shows that the compatibility identity (25), which in 
this case becomes 



X-((f0l)-(co0v)) = (X-(f0l))^(co^v) + if0^)-(X^(co®v)), 
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is satisfied. Therefore, we can form the semidirect product Lie algebra 
X(R") (D [a® © ((A° (8) 0) (D(S2 ® V))] 
whose bracket is thus given by 



[(^ 1 , ^ 1 , /l 0 0 t>i), (X 2 , g 2 , fl 0 0 V 2 )] 

= ([-X'l,X2], £xig2-£x2^1» £xi/2<S)§2 -£x2/i +/i/2<S)[?1,^2]» 

£ xi <^2 0 1^2 ~ £ x 2<^1 0 1^1 + /l <^2 0 fl • t >2 — / 2^^1 0 f 2 * l^l) • ( 26 ) 

This Lie algebra bracket coincides with the one given by formula ( 3 . 3 ) in Holm and 
Kupershmidt [ 1988 ]. 

If V = 9*, = A"“^(R'*), the space of (n — l)-forms on R”, and the repre- 

sentation of 9 on 9* is the coadjoint representation ^ • fjL = — ad| /z, for f € 9 and 
/X € 9*, then the dual of the Lie algebra above is the phase space of spin glasses. Let 
us denote the variables in 

(x{W) (D [a° © ((A® © 0) dXA"-* ® 0 *))])* = A* X A" X (A" ®0*) X (A^ ®0) 

by (P, p, G, A); here, denotes the space of /:-forms on R'*. In coordinates, P/ is 
dual to Xi , p is dual to 1 € A^, G« is dual to 1 0 where is a basis of 9*, A“ is 
dual to ( 9 / J d^x) (g) € A"“^ (S) 9*. 

Consider the skew-symmetric bilinear map given by 

^ ((Xi, gl, /l 0fl, (g) /Xi), (X 2 , g2, /l 0f2, <^2 0 P'2)) 

= j (df\ A (02){fl2, ?l) - W 2 A CDl)(/ii, f2)- (27) 

A direct verification, using the Stokes theorem and the usual exterior differential 
calculus, shows that E satisfies the cocycle identity, which in this case comes down 
to proving that the sum of all cyclic permutations of 

E(([Xi, X 2 I Xi[g2] - X 2 [gll Xi[/2] - X 2 [/l] + /l/2[tl,^2], 

£Xi( 02 <2)fl2- £x 2<^1 + f\0>2 ® • M2 - flO>l ® Hi ’ Ml) . 

(■X^3, gl, h ® I 3 . <i>3 ® M3)) 

is zero. An alternative way to express ( 27 ) is the following: 



S ((-X^l. ^1. /l ® ^1. o)\ ® Ml)> i^l^ gl> f2®Hl, *>2 ® M2)) 

= J {ihco\)f2(tii,Hi) + (9i / i)4(M2, Hi)) d"x, 

which gives 



/O 0 0 

0 0 0 

0 0 0 

\0 0 spi 



0 \ 


1 / ^ 




® \ 


0 




g 




0 


S^dk 




fH^ 




dkCO^ Ha 


0 




W*M^/ 




\i^if)HV 



X,'^(X, g, f IX) = 
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Applying Theorem 3.3. 1 it follows that the bracket given by the operator matrix 

(Pkdi + dkPi pdi Gpdi atAf-Af ,. \ 

0 0 0 

Va^, + a?, 0 5“a,+r“^Af 0 / 

which coincides with the Hamiltonian matrix in formula (2.26a) of Holm and Kuper- 
shmidt [1988], defines a Poisson bracket. 

Conclusions. In this paper we have given a glimpse at a possible connection of 
constructions of Poisson brackets with cocycles and brackets that one finds in the 
theory of complex fluids. We explicitly considered one example, namely that of spin 
glasses, following Holm and Kupershmidt [1988], but the approach hopefully is gen- 
eralizable to other systems, such as those found in Holm [2002]. One of the longer 
term objectives of this endeavor would be to gain a deeper insight into the origins 
of cocycles. One approach to this is an algebraic one developed by Kuperschmidt 
[1985], but another, as mentioned earlier, is to view cocycles as magnetic terms that 
arise by reduction as the curvature of a connection. For example, the Bott 2-cocycle 
can be viewed as arising this way and there is a rather general approach to this the- 
ory on both the Hamiltonian and Lagrangian sides, as shown in Marsden, Misiolek, 
Ortega, Perlmutter, and Ratiu [2002] and Cendra, Marsden, and Ratiu [2001a], re- 
spectively. We hope to pursue this line of thinking in the future and to make links 
with multisymplectic geometry and reduction. 
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4. Defect-Induced Transitions as Mechanisms 
of Plasticity and Failure in Multifield Continua 

O. B. Naimark* 



Abstract. The multifield statistical approach for the description of collective 
properties of mesodefect ensembles is developed to allow the determination of 
the specific nonlinear form of the evolution equation for the macroscopic tensor 
parameter of defect density. Characteristic self-similar solutions of this equation 
are found that describe the transition from damage to damage localization, strain 
localization, change of symmetry properties due to the generation of collective 
modes of mesodefects. This approach is applied to the study of stochastic crack 
dynamics, scaling effects in failure, the resonance excitation of failure (failure 
waves), the structure of plastic waves and instability in shocked condensed mat- 
ter. 



4.1. Introduction 

During the last few decades the interrelation between the structure and properties of 
solids has been one of the key problems in physics and mechanics. An extensive study 
of transformations of structures stimulates a bridging of the gap between general 
approaches of physics and mechanics of solids. This tendency provides a deeper 
insight into general laws of plasticity and failure, the latter being treated as structural 
transitions induced by defects. 

Real solids are complex in structure bound to a hierarchy of different scale lev- 
els. Solids under loading show changes on all structural levels. These changes have 
been qualified as plastic deformation and damage processes, realized by nucleation, 
evolution, and interaction of defects on appropriate structural levels as well as by the 
interaction of defects between levels. Until recently, no unified multifield theory of 
solids has been developed to describe the variety, complexity, and interplay of pro- 
cesses commonly observed on all levels of structure. Thus, to construct an adequate 
model, prime attention should be given to the choice of physical level of microstruc- 
ture and, consequently, to the type of defects. Furthermore, each structural level is 
influenced by the effects of processes occurring at the smaller scales. 



* Institute of Continuous Media Mechanics, Urals Division, Russian Academy of Sciences, Perm 614013, 
Russia. 
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Experimental study of material responses for a large range of loading rates re- 
veals some unresolved puzzles in failure and plasticity and shows the link of solid 
behavior with the evolution of the ensemble of typical mesoscopic defects (dislocation 
substructures, microcracks, microshears). This feature is particularly pronounced for 
dynamic and shock wave loading, when the internal times of the ensemble evolution 
at different structural levels are approaching the characteristic loading times. As a 
consequence, the widely used assumption in phenomenology of plasticity and failure 
concerning the subjective role of structural variables to stress-strain variables (adia- 
batic limit) can not be generally applied. Moreover, in this case another fundamental 
problem arises concerning the defect ensemble behavior when the multifield nature 
of the defect interaction and the specific role of defects are taken into account. These 
problems are related to the role of the defect ensemble that could be considered in 
the local view, in the continuum limit, as the localized change of the symmetry of the 
displacement field and in the global view as the change of the system symmetry un- 
der the generation of new collective modes. These modes could influence the system 
behavior and determine the relaxation ability (plasticity) and failure. The first change 
of symmetry follows physically from the dislocation nature of defects (formally from 
the gauge field theory) and must be reflected in the corresponding structure of internal 
variables (the order parameters). The second symmetry change is the consequence of 
the group properties of the evolution equations for the order parameters with specific 
nonlinearity, which leads to the spatial-temporal localization in defect distribution and 
the generation of the structures responsible for new internal times scales, and finally 
scenarios of failure and plasticity. In this view the multifield theory of mesoscopic de- 
fects was developed and was applied for the explanation of the effects under dynamic 
crack propagation, failure dynamics, and the flow instability in shocked condensed 
matter. 



4.2. Mesodefect properties 

4.2.1. Dislocation substructures 

It is well known that the dislocation density increases as a consequence of plas- 
tic deformation; correspondingly, changes of dislocation substructures are observed 
under the active loading, fatigue, creep, dynamic, and shock wave loading. The struc- 
ture evolves in a regular manner and despite the variety of the deformed materials 
only a few types of dislocation substructures are observed: As was shown in [1] the 
transitions in dislocation substructures belong to a general scenario for polycrys- 
talline materials and monocrystals. The succession in these transitions depends on 
dislocation interactions and temperature. Sometimes, the transition from one type of 
dislocation substructure to another leads to sharp changes in mechanical properties of 
metals and alloys. The main mechanisms of dislocation friction (viscoplastic material 
responses) and deformation hardening are related to the reconstruction of dislocation 
substructures. 
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Each type of dislocation substructure exists in a specific range of dislocation 
density and it is important that these ranges are stable for large classes of materials. 
The reason of such universality is related to the inherent property of dislocation 
ensembles as essentially nonequilibrium systems with self-similarity features, i.e., 
of characteristic nonlinear responses. This universality in the behavior of dislocation 
systems appears in the experiments as a low sensitivity of the evolution of dislocation 
structures to the external stress, but a high sensitivity to structural stresses induced 
by dislocation interaction. 

The increase of the dislocation density is accompanied by a decrease of the 
distance between dislocations and by stresses due to dislocation interaction. The col- 
lective properties in dislocation ensembles play the leading role in these transitions 
and the substructure formation. The driving agency for the reconstruction of disloca- 
tion ensembles is the tendency to reach a relative minimum of total energy through 
the creation of dislocation substructures [2]. The energy of the dislocation substruc- 
ture includes two parts: the dislocation self-energy and the energy of the dislocation 
interaction. The reconstruction of the dislocation substructure leads to the change of 
both parts. As a consequence, the energy of a newly formed dislocation substructure 
is less than the energy of the preceding substructure. 

The main part in the energy of the dislocation substructure belongs to the dis- 
location self-energy [2] 



AC/ = 



pGb^ 

In 




( 1 ) 



where p is the dislocation density in the dislocation substructure, h is the Burgers 
vector, G is the shear elastic modulus, ro is the radius of the dislocation nucleus, and 
L is the screening radius of the elastic field created by the dislocations. The last scale 
plays the important role in the evolution of dislocation substructures: The increase 
of the dislocation density leads to the decrease of the L-scale in the order of the 
substructure sequence. The typical dislocation substructures that are experimentally 
observed are chaotic: tangles, walls of the cells, subboundaries of strip substructures. 
The value of L is close to the grain size of chaotic substructures and is of the order 
of the width of cell walls or of high density dislocation areas in strip substructures. 

Under the transition to misoriented substructures dislocation charges arise, 
which lead to a decrease of the energy: charge signs change their spatial distribu- 
tion and, simultaneously, substructure parameters become finer. 

Such evolution reflects the self-organizing tendency of reconstruction in the 
dislocation ensemble. The existence of a critical dislocation density corresponding 
to the creation of dislocation substructures was discussed in [1]. Typical values of 
critical dislocation density are presented in Table 1 for copper alloys. 

As usual, the appearance of the local fluctuation of dislocation density proceeds 
to the transformation of one substructure into another. The growth of these fluctuations 
leads to the change of the distribution function for the dislocation density caused by 
the second mode generation. These regularities of the evolution of defect substructures 
allow one to consider the latter as independent subsystems during deformation. In this 
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Table i. Types of transition in dislocation substructures [1]. 



Type of transition 




chaotic substructure tangles 


0.2 


tangles oriented cells 


0.2-0.5 


oriented cells disoriented cells 


0.4-0.6 


pileups -> homogeneous net substructure 


0.2-0.3 


disoriented cells -> net substructure strips 


0.5-2.0 


disoriented cells strips 


3.0 


strips -> substructure with continuous disorientation 


3.0-4.0 



case the list of governing parameters, like temperature and stress, can be extended to 
comprise the order parameters related to the dislocation substructures. 

It appears that the significance of collective effects in the dislocation ensemble 
is enhanced; the dislocation structure is more directly affected by the current value 
of the dislocation density than by the current external stress. 

Thus one is led to the assumption that the dislocation density is the independent 
variable on which the qualitative changes in the material structure and the mechanical 
responses depend. In addition, the hardening parameters are linked with the transfor- 
mations in the substructures. 



4.2.2. Microcrack (microshear) ensemble 

Of all structural levels of subdislocation defects, the level of microcracks and mi- 
croshear may be considered as the representative for the developed stage of plastic 
deformation and failure. Other defects (point defects, dislocations, dislocation pile- 
ups) have smaller values of intrinsic elastic fields and energies in comparison with 
microcracks and microshears. Moreover, the nucleation and growth of these defects 
(that are closest to the macroscopic level) are the final acts of the previous rearrange- 
ment of the dislocation substructures. 

The density of these defects reaches 10^^ - lO^'^cm”^, but each mesoscopic 
defect consists of a dislocation ensemble and exhibits the properties of this ensemble. 
Scenarios of the evolution of ensembles of these mesoscopic defects show features 
of nonequilibrium kinetic transitions, and experimental data obtained in a wide range 
of stress intensities and rates of strain confirm the universality of structural evolution 
and its effect on relaxation properties and failure. 

The typical sizes and concentrations for the microcrack ensemble in different 
materials are represented in Table 2 [3]. 

The important features of the quasi-brittle fracture were established for the 
understanding of various stages of failure: damage, damage localization, crack nu- 
cleation and propagation. It was shown that microcracks have the dislocation nature 
and represent the hollow nuclei of the dislocation pileups. The model representa- 
tion of microcrack as dislocation pileup [4] allowed the estimation of the microcrack 
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Table 2 



Material 




/i(cm 1 


X-ray 


Microscopy 


X-ray 


Microscopy 


aluminum 


0.14 


0.2 


IF' 


— 


nickel 


0.08 


0.1 




2 X lOJ^ 


gold, silver 


— 


0.2 


— 


2 X 10" 


copper, zinc 


— 


0.25 


— 


5 X 10“ 


beryllium 


0.12 


— 


5 X lo’’-* 


— 


steel 30CrMCN2A 


— 


0.1 


~~ 


— 


NaCl 


2 


1-3 


10**-10^ 


10^ 


polyethylene 


— 


0.015 


— 


6 X 10*=’ 


polypropylene 


— 


0.02 


— 


7 X lO*'* 


PMMA 


— 


0.02 


— 


4 X 10'^ 



self-energy [3, 5] 




where B — nb is the total Burgers vector; s = BSo is ih^ penny-shape microcrack 
volume; So is the microcrack base; Vq = is the volume of the defect nuclei with 
ro the characteristic size of the defect nuclei and R the characteristic scale of the 
elastic field produced by microcracks. The estimation given in [3, 4] showed that the 
power of the dislocation pileup is close to n ^ 20. 

Two reasons are important for the dislocation representation of microcracks. 
The first one is the determination of the microcrack energy as the energy of the 
dislocation pileup. The second reason is the determination of microscopic parameters 
for the microcracks as the consequence of the symmetry change of displacement field 
due to the microcrack nucleation and growth. Study of the microcrack (microshear) 
size distribution for the different spatial scales revealed the self-similarity of the 
mesodefect pattern, Figure 1 [6]. The statistical self-similarity reflects the invariant 
form of the distribution function for the mesodefects of different structural levels. 
This fact has important consequences for the development of the statistical multifield 
theory of the evolution of the defect ensemble. 



4.3. Order parameters of continuum with defects 

4.3.1. Some results of gauge field theory 

Currently, the gauge theory is used for the analysis of the structural and physical 
properties of materials with defects, being of great help in developing continuum 
models of such media for the following reasons. Nucleation and growth of defects 
change the diffeomorphic structure of displacement fields. Using the formalism of 
gauge field theory (the Yang-Mills formalism [7]), these changes can be introduced 
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Figure i. Distribution of microcracks and microshears in coordi- 
nates with and without dimension: n is the microcrack concentra- 
tion, I is the characteristic size, and «sc» he are scaling parame- 
ters [6]. 



as the localization of the corresponding symmetry group of the distortion tensor and 
considered as additional internal coordinates that are kinematically allowed. 

The structure of the so-called gauge fields must correspond to the type of defects, 
and the basis for the application of gauge theory to our problem is the establishment of 
the internal symmetry group for the medium with defects. Most mechanical models of 
deformed continua are invariant with the respect to the uniform groups of translations, 
T (3), and rotations, 50 (3). This means that the Lagrangian of the system is invariant 
under the transformation 



^ a:' = + 

where x and ^ are the current and initial coordinates, g(§ , 0 and r (^ , /) are the rotation 
and translation operators, respectively. 

The differentiation operator is invariant with the respect to the transformations 
T (3) and 50(3) and is given by the covariant derivative 

D^x = 3^x -f* VfiX -f 

where and are the differential operators related to rotation and translation [7]. 
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For a medium with defects, the homogeneity of transformation S O (3) and T (3) 
is violated, the operators and correspond to the local rotations and translations 
related to the defects; the internal symmetry group corresponds to the so-called semidi- 
rect product SO(3)T (3). The elements and of this group, which are functions 

of coordinates and time, are the gauge fields. 

We can thus construct a Lagrangian of the medium with defects using the min- 
imal expansion and write the Lagrangian of the system with two additional variables 
F;x and [7] 




where 



^fiv 



^/xFy — Sx,F u + FuFy — FyFp 



Oifiv — "i" r jxPv ~~ ^vPfi "i* 

are the so-called intensities of the gauge fields related to the local rotations and 
translations due to the presence of defects; gik are the components of the metric 
tensor. The phenomenological parameters C\ and C 2 in the Lagrangian L are the 
constants of dislocation interaction. We will use expression (3) for the Lagrangian in 
our analysis of the statistical properties of the ensembles of defects. 

4.3.2. Microscopic and macroscopic variables for microcracks (microshears) 
ensemble 

The structural parameters associated with microcracks and microshears were intro- 
duced [3] as the derivative of the dislocation density tensor. These defects are de- 
scribed by symmetric tensors of the form 

Sik=sViVk (4) 

in the case of microcracks and 

Sik = l/2s{vik + liVk) (5) 

for microshears. Here v is unit vector normal to the base of a microcrack or slip plane 
of a microscopic shear ; I is a unit vector in the direction of shear; s is the volume of 
a microcrack or the shear intensity for a microscopic shear. 

The change of the diffeomorphic structure of the displacement field due to 
these defects also has important consequences from the point of view of the symmetry 
change of the system “solid with defects.” This symmetry aspect can be used to model 
arbitrary defects both in crystalline and amorphous materials without the assumption 
concerning the dislocation nature of the defects that originally is the property of 
crystalline materials. 

The average of the “microscopic” tensor Sik gives the macroscopic tensor of 
the microcrack or microshear density 

Pik = n{sik), 



( 6 ) 
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that coincides with the deformation caused by the defects, where n is the defect 
concentration. 



4.4. Statistical model of continuum with mesodefects 

4.4.1. Effective field method 

The effective field is frequently used as an auxiliary field (real or virtual) introduced 
into a theoretical model in order to take into account complex factors (like interparticle 
interactions) that are either too difficult to evaluate rigorously or not even clear in 
detail. 

When we refer to “the effective field method” here, we mean that we use an 
auxiliary external multicomponent field constructed in such a way that, when added 
to the corresponding term in the Hamiltonian of the system, makes it a state of 
equilibrium at any given instant. 

The effective field method was reintroduced [8] into statistical physics by Leon- 
tovich [9]. According to Leontovich, for an arbitrary nonequilibrium state of any 
thermally uniform system that is characterized by definite values of internal param- 
eters, the transition into the equilibrium state with the same values of those internal 
parameters may be performed by introducing an additional force field. By definition, 
the entropy of this nonequilibrium state is equal to the entropy of the equilibrium 
(being that due the presence of the additional force field) state characterized by the 
same values of the considered material parameters. 

The microscopic kinetics for the parameter Sik is determined by the Langevin 
equation 



^ik — P'ik 



(7) 



where Kik = ^ ; E is the energy of the defect and Fik is a random part of the force 
field and satisfies the relations {Fnc(t)) = 0 and (Fik(t')Fik(t)) = Q8(t — F). The 
parameter Q characterizes the mean value of the energy relief of the initial material 
structure (the energy of defect nuclei). 

The statistical model of the microcrack (microshear) ensemble was developed 
in the terms of solutions of the Fokker-Planck equation in [2, 3] 



a a 1 a^ 

dt^ ^ 2^ dsikdsik 



w. 



( 8 ) 



According to the statistical self-similarity hypothesis the distribution function 
of defects can be represented in the form W = exp(— -^), where Z is the nor- 
malization constant. By (8), the statistical properties of the defect ensemble can be 
described in terms of the defect energy E and the dispersion properties of the system 
given by the value of Q. 

In terms of the microscopic and macroscopic variables and according to the 
presentation of these mesodefects as the dislocation substructure, the energy of these 
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defects (the Lagrangian) can be written in the form 

E = Eo- HikSik + ocsfi^, (9) 

where the quadratic term represents the self-energy of defects (2) and the term HikSik 
describes the interaction of the defects with the external stress aik and with the en- 
semble of the defects in the effective field approximation: 

Hik = Oik + >^Pik = Oik + ^n(sik), (10) 



where a, X are material constants. The averaging procedure gives the self-consistency 
equation for the determination of the defect density tensor 



Pik = n 



j SikW{s,v,l)dsik. 



( 11 ) 



For the dimensionless variables 

1 ^ fa 



atk = 



Oik 



the self-consistency equation has the form 

Pik = j hkZ~^ exp ^^(7, A + hk - ^fl^dSik, 

which includes the single dimensionless material parameter 5 = ~. Dimensional 
analysis allows us to estimate 



( 12 ) 



a ~ — , A. ~ G, n ^ R 
Vo 



-3 



Here G is the elastic modulus, Vq ~ mean volume of the defect nuclei, 

and R is the distance between defects. Finally we obtain for 8 the value 8 ~ (~)^ 
that corresponds with the hypothesis concerning the statistical self-similarity of the 
defect distribution on different structural levels. The solution of the self-consistency 
equation (12) was found for the case of the uniaxial tension and simple shear [2] 
(Figure 2). 

The existence of characteristic nonlinear behavior of the defect ensemble in the 
corresponding ranges of <5(<5 > 8 ^ ^ 1 . 3 , 8 c <8 < 8^,8 < 8 c ^ 1) was established, 
where 8c and 8^ are the bifurcation points. It was shown [3, 4] that the above ranges 
of 8 are characteristic for the quasi-brittle (5 < ^ 1)» ductile (^c < ^ < ^*)» and 

nanocrystalline (5 > 8^^ ^ 1.3) responses ofmaterials. Thus the behavior of the defect 
ensemble in the different ranges of 8 is qualitatively different. The replacement of 
the stable material response for the fine grain materials with the metastable response 
for the ductile materials with the intermediate grain size occurs for the value of 
8 = 8 ^ ^ 1 . 3 , when the interaction between the orientation modes of the defects 
has a more pronounced character. Therefore, the metastability has the nature of an 
orientation ordering in the defect ensemble. It will be shown in Section 4.8 that the 
continuous orientation transition due to the growth of the defect density (and, as a 
consequence, the decrease of 5) provides the specific mechanism of the momentum 
transfer that is conventionally known as the plastic flow. As will be discussed in 
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Figure 2. Characteristic responses of materials on defect growth. 




Figure 3. Free energy dependence on stress and defect density for 
S < Sc ^ 1. 



Section 4.6, the range of 5 < 5c ^ 1 provides the specific nonlinear kinetics of defect 
evolution, which leads to the nucleation of failure hotspots. 



4.5. Collective properties of ensembles of defects 

4.5.1. Phenomenology of solids with defects. Free energy 

The statistical description allows us to study the phenomenology of solids with defects 
on the basis of an appropriate choice of the free energy form F. As (11) is none other 
than = 0, the simple phenomenological form of the part of the free energy caused 

by defects (for the uniaxial case p = cr = e = e^z) is given by the six order 
expansion, similar to the well-known Ginzburg-Landau expansion [11] 
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The bifurcation points 5*, Sc play a role similar to that of the characteristic 
temperatures in the Ginzburg-Landau expansion in the phase transition theory. The 
gradient term in (13) describes the nonlocal interaction in the defect ensemble in the 
so-called long wave approximation; A, B, C, D, and x are the phenomenological 
parameters. 

The defect kinetics is determined by the evolution inequality [5] 



SF SFdp ^ 

— = r <0 

St Sp dt 



(14) 



that leads to the kinetic equation for the defect density tensor 



dp 

It 







where F is the kinetic coefficient. 

The kinetic equation (15) and the equation for the total deformation 

£ z= Ca -h p. (16) 



(C is the component of the elastic compliance tensor) represent the system of the 
constitutive and evolution equations for a solid with the considered types of defects. 



4.5.2. Collective properties of defect ensemble 

As follows from the solution of (11), presented in Figure 2, transitions through the 
bifurcation points Sc and S^ lead to a sharp change in the symmetry of the distribution 
function as a result of the appearance of some orientationally pronounced macroscopic 
modes of the tensor pik- The effect of transitions on the evolution of the defect 
ensemble is determined by the type of bifurcation, i.e., by the group properties of the 
kinetic equation for the tensor pik for different domains of 5(5 > 5+, 5c <5 <5*, 
5 < 5c). The qualitative relationships governing the changes in the behavior of the 
system are reflected in Figure 4 in the form of families of heteroclines, which are the 
solutions of the equation 

^(l-l)p-V + c(l-0p=-O„-A(^^)=O. (17) 

In the region 5 > 5* this equation is of the elliptic type with periodic solutions 
with spatial scale A and possesses p anisotropy determined mainly by the applied 
stress. This distribution of p gives rise to weak pulsations of the strain field. As 5 5+, 

the solution of (17) passes the separatrix 52, and the periodic solution transforms into 
a solitary-wave solution. This transition is accompanied by divergence of the inner 
scale A : A ^ — ln(5 — 5*). In this case the solution has the form p{t;) = p{x — Vt), 
The wave amplitude, velocity and the width of the wave front are determined by the 
parameters of nonequilibrium (orientation) transition: 

P = \pa{\ - tanh(f/-i)], I = — . 

2 Pa \ A/ 



( 18 ) 
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1 V 







Figure 4 . Types of heteroclines and the corresponding character- 
istic forms. 

The velocity of the solitary wave is 

y XMPa - Pm) 

2^2 

where (pa — Pm) is the jump in p in the course of an orientational transition. A 
transition through the bifurcation point 8c (separatrix ^ 3 ) is accompanied by the 
appearance of spatio-temporal structures of a qualitatively new type characterized by 
explosive accumulation of defects SiS t ^ tc in the spectrum of spatial scales (“blow- 
up” regime) [12, 13]. In this case the kinetics of p is determined by the difference of 
the power of the terms in the expansion (13). 

Assuming a power law dependence of the nonlocality parameter on p, the kinetic 
equation (15) can be written in the form 

f ==S(/..)p“+^(xo(Pc)/f). (19) 

where co = 5/3. It is shown in [12, 13] that for this type of equations the developed 
stage of the kinetics of p in the limit t tc can be described by a self-similar solution 

jc / r \ 

p(x,t) = cl>(t)f(n. ? = — , 0(O = Oo(l--j , (20) 

where m > 0, Oo > 0 are the parameters related to the nonlinear form of (15); 
Lc and tc are the scaling parameters. The function /(f) is determined by solving 
the corresponding eigenvalue problem. For example, for the case co = + 1, the 

self-similar solution of (19) has the form 

Pi,, t) = [S« - ,,)]-* 01^ sin" + »«))' . (21) 

where 0 is a random value in the interval (0, 1). The scale Lc, the so-called funda- 
mental length [13], has the meaning of a spatial period of the solution ( 20 ), 

27T 



1 
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The self-similar solution (20) describes the blow-up damage kinetics for t tc 
on the set of spatial scales Lh = kLc, k = 1,2,...,^. The blow-up kinetics 
of damage localization allowed us to link the hotspots of failure with the above- 
mentioned self-similar structures. 



4.6. Collective behavior of cracks and defects 

4.6.1. Introductory remarks 

The interaction of the main crack with the ensemble of defects is the subject of 
intensive experimental and theoretical studies that revealed some unresolved puzzles 
in the quasi-brittle failure. The longstanding problem is the limiting velocity of a 
crack. The linear elastic theory predicted that a crack should continuously accelerate 
up to the Rayleigh wave speed V/?; however, the experiments on a number of brittle 
materials [14] showed that speed of the crack will seldom reach even the half of 
this value. A portrait of the dissipation process was suggested in [5, 15] where the 
main role in the qualitative explanation of the mentioned effects was assigned to the 
collective modes of the defect ensemble and to the interaction of these modes with 
the moving crack. Below we will discuss the phenomena investigated intensively 
during the past decade within the nonlinear dynamics of crack propagation. These 
phenomena show the qualitative new features caused by the interaction of the crack 
with the ensemble of defects in the so-called process zone. 

The renewed interest in the dynamic fracture during last decade is due to the 
variety of experimental results that cannot be explained within the classical fracture 
mechanics, within which only two steady-state velocities are predicted for a crack in an 
infinite plane specimen: zero and the Rayleigh speed [16]. The recent experimental 
study revealed the limiting steady-state crack velocity, a dynamical instability to 
microbranching [16, 17], the formation of a nonsmooth fracture surface [18], and the 
sudden variation of fracture energy (dissipative losses) with a crack velocity [19]. 

This renewed interest was the motivation for studying the interaction of defects 
at the crack tip area (process zone) with a moving crack. The still open problem of 
crack evolution is the condition of crack arrest, a condition related to the question of 
whether the crack velocity approaches zero smoothly as the load is decreased from a 
large value to the Griffith point [20]. There is also a problem at the low end of crack 
velocity: how a crack that is initially at rest might achieve its steady state. 

4.6.2. Some results in crack mechanics 

The subject of the rapid propagation of a crack in a brittle material is one whose 
roots go back to the classical works of Griffith and Mott [22, 23]. Since that time a 
great deal of both analytical and experimental work [24, 25, 26] has been expected to 
understand the phenomenon of fracture. To study the crack stability and the interaction 
of the main crack with the defect ensemble in the process zone we will consider first 
briefly the classical results in crack mechanics, where much analytical progress has 
been made in assuming that the medium behaves according to the equations of linear 
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Figure 5. The Griffith (1) and Fraenkel (2) energy form of elastic 
solid with a crack. 



elasticity. The important step to model the brittle failure was made by Griffith [22] 
when the additional characteristics of the crack resistance were introduced in the form 
of the energy of the development of the new surface at the crack tip. According to 
the Griffith theory the energy U of elastic materials with a crack is represented in the 
form (Figure 5, curve 1) 

( 7Up‘\ 

^ = -2g(— + 2 ^“’ 



where y is the surface energy; g is the applied stress; a is the crack length; and G is 
the elastic modulus. 

Irwin [24] developed the Griffith conception and proposed the force version of 
the crack stability introducing the stress intensity factor 

Ki = G^^fna. (23) 



Barenblatt [25] proposed a variant of the force version of the crack stability that 
reflects another view on the role of stress in the crack tip area. The effects of cohesion 
forces were assumed to be relevant at the crack tip over the area 0 < 5 < J. 

These forces have a molecular nature and alter the singularity of the stress field. Two 
hypotheses were assumed in the Barenblatt approach: 

(i) small size of the crack tip zone in comparison with the crack length ( \ « 1); 

(ii) autonomous behavior of crack tip, i.e., the self-similar evolution of the crack 
tip in the steady-state regime. 

This hypothesis conforms with the properties that reveal quasi-brittle behav- 
ior. The self-similarity features of the crack tip evolution are the consequence of the 
small ratio of the applied stress g and the cohesion force i.e., ^ 1. This 

fact reflects the intermediate asymptotic character of quasi-brittle failure theories. A 
material parameter was introduced, the so-called the cohesive modulus, as an inde- 
pendent strength property of materials: Kb = Gb yfd. Despite the similar form of the 
cohesive modulus Kb and the stress intensity factor K\ there is a difference between 
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the Irwin criteria and those based on the cohesive modulus proposed by Barenblatt. 
The cohesive modulus determines the steady-state character of crack propagation but 
not the catastrophic one corresponding to the Griffith-Irwin approach. The qualitative 
difference between the two approaches is evidenced by the remarks by Fraenkel [27] 
in the critical analysis of the Griffith approach. Fraenkel wrote that the physically 
realistic form of the energy U must contain the local minimum Ue(ae) (Figure 5, 
curve 2). The difference in the energy At/ = Uc — Ug determines the work of the 
stress field at the crack tip under transition from the steady-state to the unstable regime 
of crack propagation. This work provides the means to surmount the energy barrier. 
It is natural to assume that the cohesive modulus is the force related to this barrier. 
We will show in the following that the metastable energy form, assumed by Fraenkel, 
is related to the collective behavior of the defect ensemble in the process zone and to 
the interaction of the defects with the main crack. 

4.6.3. Origin of crack instability 

The classic theory of fracture treats cracks as mathematical branch cuts that begin to 
move when an infinitesimal extension of the crack releases more energy than needed 
to create a fracture surface. This idea is successful in some cases in practice but 
conceptually incomplete. The experiments fail to confirm this idealized picture. The 
surface created by the crack is not necessarily smooth and flat. In a series of experi- 
ments on brittle fracture the simultaneous propagation of an ensemble of microcracks, 
instead of a single propagating crack, was observed [15]. The fracture process was 
then viewed as a coalescence of defects situated in the crack path; the mean accel- 
eration drops, the crack velocity develops oscillations, and a structure is formed on 
the fracture surface [17, 18, 19, 20]. As the branches grow in size, they evolve into 
macroscopic large-scale crack branches. 

A theoretical explanation of the limited steady-state crack velocity and the tran- 
sition to the branching regime was proposed in [28, 29, 30] within a study of the 
collective behavior of the microcrack ensemble in the process zone. It was shown 
by the solution of the evolution equation for the defect density tensor that the kinet- 
ics of microcrack accumulation at the final damage stage includes the generation of 
spatial-temporal structures (dissipative structures with blow-up damage kinetics) that 
are precursors of the nucleation of the “daughter” cracks. The kinetics of the daughter 
crack generation is determined by two parameters that are given by the self-similar 
solution (20), 

JC / f \ 

p{x, t) = ^ = ***0 ( ^ - 7 j ’ 

where Lc and tc are the scaling parameters that can be found through the solution 
of the corresponding nonlinear eigenfunction problem [12, 13, 29]. These parame- 
ters are the spatial scales Lc of the blow-up damage localization and the so-called 
“peak time” which is the time of damage localization in the self-similar blow-up 
regime. The velocity limit Vc of the transition from the steady state to the irregular 
crack propagation is given by the ratio: Vc ^ 4^. As shown in Section 4.5, the set 

*c 
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of spatial scales Lh (daughter crack sizes) is proportional to Lc and represents a 
new set of independent coordinates (collective modes of the defect ensemble) of the 
nonlinear system for a > cxc. These coordinates characterize the property of a sec- 
ond attractor that could determine the behavior of solutions of the nonlinear system. 
The first attractor corresponds to the well-known self-similar solution (23) for the 
stress distribution at the crack tip which is the background for the stress intensity 
factor conception. This solution is available in the presence of the metastability (local 
minimum) for p in the range a < Gc- 

The steady-state crack propagation occurs when the stress rise in the process 
zone provides the failure time tf>tc — ^iox the creation of the daughter crack 
only in the straight crack path. The failure time tf follows from the kinetic equation 
(15) and represents the sum of the induction time U (the time for the approach of 
the defect distribution to the self-similar profile on the L// = kLc scales) and the 
peak time tf = ti tc. For velocities V < Vc the induction time is ^ tc 
and the daughter crack appears only along the initial main crack orientation. For the 
crack velocity V ^ Vc there is a transient regime (ti ^ tc) of the creation of a 
number of localization scales (daughter cracks) in the main crack path. The crack 
velocity growth in the area V > Vc leads to the sharp decrease of the induction 
time ti ^ 0, tf -> tc that is accompanied by the extension of the process zone in 
both (tangential and longitudinal) directions where the multiple blow-up structures 
(daughter cracks) and, as a consequence, the main crack branching appear. The last 
situation is qualitatively similar to the resonance excitation of numerous mirror zones 
under the spall condition but in the nonuniform stress field in the range of angle with 

G > Gc. 

4.6.4. Experimental study of nonlinear crack dynamics 

Experimental setup Direct experimental study of crack dynamics in the preloaded 
PMMA plane specimen was carried out with the use of a high speed digital camera 
Remix REM 100-8 (time lag between pictures lOfis) coupled with photoelasticity 
method (Figure 6) [29, 30, 31]. 

The pictures of stress distribution at the crack tip are shown in Figure 7 for 
slow (V < Vc) and fast (V > Vc) cracks. The experiment revealed that the path of 
the critical velocity Vc is accompanied by the appearance of a stress wave pattern 
produced by the daughter crack growth in the process zone. Independent estimation 
of the critical velocity from the direct measurement of the crack tip coordinates and 
from the pronounced stress wave Doppler pattern gives a correspondence with the 
Fineberg data (Vc ^ 0.4V/?) [16]. 

Characteristic crack velocity The dependence of crack velocity on the initial 
stress is represented in Figure 8. Three portions with different slopes are shown. The 
existence of these portions determines three characteristic velocities: the velocity of 
the transition from the steady state to the nonmonotonic straight regime V^ 220m/s, 

the transient velocity to the branching regime Vc ^ 330m/s, and the velocity Vb ^ 
600m/s when the branches behave autonomously. 
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Figure 6. Scheme of the experiment. 
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Figure 7. Different regimes of crack dynamics. 
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Figure 8. Crack velocity versus applied stress. 











92 



O. B. Naimark 




Figure 9 . Mirror zone concentration versus applied stress. 



The characteristic velocity Vc ^ 330m/s allowed us to estimate the peak time tc 
to measure the size ofthe mirror zone Lc ^ 0.3mm: /c = ^ ^ 1 x 10“^s. This result 
led also to the explanation of the linear dependence of the branch length on the crack 
velocity [20]. Actually, since the failure time for V > Vc is approximately constant 
(tf ^ tc ^ 1 /xs), there is a unique way to increase the crack velocity to extend the size 
of the process zone. The crack velocity V is linked with the size of the process zone 
Lpzhy the ratio V = In our experiments, the dependence of the density of the 
localized damage zone on the stress was observed (Figure 9). Since the branch length 
is limited by the size of the process zone, we obtain the linear dependence of branch 
length on the crack velocity. This fact explains the sharp dependence (quadratic law) 
of the energy dissipation on the crack velocity established in [ 20 ]. 

4.6.5. Scaling properties of failure 

The scaling properties of failure stimulated great interest in the general problem of 
disordered media when the self-affinity of the failure surface was established in terms 
of the universality of the so-called roughness exponent. The self-affinity of the fracture 
surface was established first by Mandelbrot [32] as the existence of the power law 
of the distance r measured within the horizontal plane for the points at which the 
heights h{r) are measured. This defines the surface-roughness index fas h(r) cx r^. 
Experimental data of the past decades on the measurement of the roughness exponent 
revealed the self-affinity of the fracture surface and it was established that the pattern 
of the fracture surface can be considered as scale invariant objects with the roughness 
index f 0.8 ± 0.05 [33]. This fact allowed the determination of the length scales 
r > ro where the roughness exponent is the invariant. The range of scales r > ro 
for many materials shows the universal scaling properties (the roughness exponent 
f « 0.8). For the scales r < ro the roughness exponent can change. The scale 
invariant property for r > ro means the transition from the roughness statistics caused 
by the initial structural heterogeneity (size of blocks, grains) to the statistics given by 
the collective properties of defects under transition from damage to fracture. 
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Figure io. The roughness of the failure surface in PMMA. 




Figure ii. Failure surface for slow V < Vc and fast 
(Vb > V > Vc) cracks. 

In our experiments the roughness profile was determined for the PMMA fracture 
surface (Figure 10) using the laser scanner system. Fracture surface analysis revealed 
the correspondence of the sharp change of the crack dynamics for the velocities Vc 
and Vb and the fractographic pattern. The fractographic image of the fracture surface 
was studied in the velocity range V ~ 300 — 800m/s when different regimes of the 
crack propagation were observed. 

The first regime V ~ 220— 300m/s is characterized by the mirror surface pattern 
(Figure 11). The increase of the crack velocity in this range leads to the characteristic 
pattern on the mirror surface in the form of the so-called conic markings [34]. The 
conic markings are the traces of the junctions of the main crack and the damage 
localization zones that nucleate in the process zone. These data reflect the influence of 
the damage kinetics on the characteristic size of the above mentioned failure structures 
on the surface. 

The second regime appears in the velocity range 300-600m/s and the surface 
pattern includes numerous mirror zones. 

The analysis of the roughness data in terms of the roughness exponent showed 
the dependence of the scaling properties on the regime of crack propagation. However, 
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Figure 12 . Scheme of stress phase portrait recording. 



data have been collected on a group of specimens characterized by the exponent f ^ 
0.8. This fact allowed us to assume the existence of the regime of crack propagation 
with universal scaling index close to f 0.8. The existence of different scaling 
indexes for other regimes of crack propagation reflects the variety of the behavior of 
the investigated nonlinear system. As was shown, the crack dynamics in quasi-brittle 
materials is subject to two attractors. The first attractor is given by the intermediate 
asymptotic solution of the stress distribution at the crack tip. The self-similar solution 
(9) describes the blow-up damage kinetics on the set of spatial scales and determines 
the properties of the second attractor. This attractor controls the system behavior 
for W > Vb when there is a range of angles with a > Gq. The universality of the 
roughness index can be considered also as a property of this attractor. In the transient 
regime Vj? > V > Vc the influence of two attractors can appear; here is a possible 
mechanism for the dispersion of experimentally measured roughness on the scale 
r > ro. 

The scaling properties of failure were studied also under the recording of the 
stress dynamics using the polarization scheme coupled with the laser system (Fig- 
ure 12). 

The temporal history of stress was measured at the marked point away from 
the main crack path at a fixed (4mm) distance. This allowed us to investigate the 
correlation property of the system using the stress phase portrait d ~ or for slow 
and fast cracks (Figure 13). These portraits display the periodic stress dynamics 
(Figure 13(a)) in correspondence with the local ellipticity of (15), for a < otc (V < 
Vc)» and the stochastic dynamics (Figure 13(b)) for V > Vc, corresponding to the 
second type of the attractor. In the transient regime V ^ Vc the coexistence of two 
attractors may appear, leading to the intermittency effect as the possible reason for 
the scaling index dispersion. 
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Figure 13 . The Poincare cross-section for the phase variables d ~ 
a: (a) V = 200m/s. (b) V = 615m/s. 
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Figure 14 . The stress history. 



The recording of the temporal stress history at the marked point for V > Vc 
revealed the appearance of finite amplitude stress fluctuations, which reflect the qual- 
itative new structural changes in the process zone for the fast crack (Figure 14). The 
scaling properties, like the above attractor properties, were studied in terms of the 
correlation integral calculated from the stress phase pattern using the formula [35] 



C(r) = lim --2 ^ 

m-^00 



^ r 



ij=l 



where jc/, xj are the coordinates of the points in the d ~ or space, and //(•) is the 
Heaviside function. The existence of the scales r > ro with the stable correlation index 
was established for the regimes V < VcandV^ > V > Vc (Figure 15). The values of 
the correlation indexes in these regimes show the existence of two scaling regimes with 
the deterministic (V = 200m/s, v ^ 0.8) and stochastic (V = 426, 613m/s, v 0.4) 
dynamics. The extension of the portions with constant indexes determines the scale of 
the process zone Lpz- The length of the process zone increases with the growth of the 
crack velocity in the range Vfi > V > Vc with the preservation of the scaling property 
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Figure 15 . Correlation integral. 



of the dynamic system. Numerical simulation of the damage kinetics in the process 
zone allowed us to conclude that this scaling is the consequence of the subordination of 
the failure kinetics to the blow-up self-similar solution that determines the collective 
behavior of the defect ensemble in the process zone [29, 30, 31]. 



4.6.6. Low velocity limit. Crack arrest 

In this part we address the question of how a dynamic crack may approach zero 
velocity. This phenomenon was discussed in [21] and it was shown (considering a 
simplified version of the strip specimen with the radiation at the boundaries) that a 
steady-state velocity law with a square root behavior is expected as a function of the 
excess load over the Griffith load. This means that the steady-state velocity increases 
with an infinite slope near zero overload but in a smooth fashion otherwise. It was 
shown also that if the crack has no field inertia, the pass over the “trapping limit” will 
lead to crack movement. However, if the crack has a field inertia the crack velocity 
will exhibit a transient oscillation. A similar conclusion can be reached by comparing 
the estimation of the crack velocity given in [ 21 ], 



V = w 



{Ee - 2y) 

C{w) ’ 



where w is the Barenblatt cohesive zone of the crack, {Eg — 2y) is the Griffith static 
term, and C{w) is a material parameter, with the results predicted by the statistical 
model . Since w is similar ioLpz ^ root term is the inverse characteristic time 

tc given by the self-similar solution. This fact allowed us to determine the range of 
the application of this generalized Griffith relation, where the crack can approach rest 
smoothly: V < Vc. For V > Vc, when the “wave part” of the energy increases with 
crack velocity, the crack arrest appears nonsmooth because the energy metastability 
provides “crack trapping.” A similar view can be developed to analyze the crack 
overload above the Griffith value before any state motion. 
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4.7. Resonance excitation of failure 

4.7.1. Delayed failure phenomenon. Failure waves 

The phenomenon of failure wave in brittle materials has been the subject of intensive 
study during the last two decades. The term “failure wave” was introduced in [37] 
as the limit condition in damage evolution when the number of microshears is large 
enough for the determination of a front with a characteristic group velocity. This 
front separates the structured material from the failed area. The important feature 
of the failure wave phenomenon is that the velocity of failure wave doesn’t depend 
on the velocity of propagation of the single crack having the theoretical limit equal 
to the Rayleigh wave velocity. The stored elastic energy in the material is the main 
factor, which provides the ability of a brittle solid to generate a failure wave. The high 
capacity of elastic energy in the material can be generated under bulk compression 
or under conditions of loading that lead to a state close to the bulk compression, for 
instance, under impact loading. High capacity of elastic energy can be realized by 
the removal of surface defects or structure homogenization. Brittle materials (glasses 
and ceramics) exhibit very high dynamic compressive strength, the Hugoniot elastic 
limits [38]. For glasses with high strength or high latent compressive stress self- 
keeping failure can be observed not only for compression but also for bending and 
tension. 

Rasorenov et al. [39] were the first to observe the phenomenon of delayed fail- 
ure behind an elastic wave in glass. Such a wave was introduced in [40] where the 
concept of a wave of fracture was discussed to explain the nature of the elastic limit. 
The existence of a failure wave was established by considering a small recompres- 
sion signal in the VISAR record of the free surface velocity of K19 glass (similar 
to sodalime). This recompression signal resulted from a release returning after re- 
flection of the shock at the glass rear surface reflecting again in compression at the 
lower impedance failure front. Recent studies have suggested that a wave of failure 
propagates behind the elastic wave in glass with a velocity in the range 1. 5-2.5 km/s. 
Failure wave appeared in shocked brittle materials (glasses, ceramics) as a particu- 
lar failure mode in which they lose strength behind a propagating front. Generally, 
interest in the failure wave phenomenon originates from the still open problem of 
the physical interpretation of traditionally used material characteristics such as the 
Hugoniot elastic limits, dynamic strength, and the relaxation mechanism of the elastic 
precursor. 

Recent research has shown that glasses having open structures may fail in a 
characteristic way when loaded with plane shock waves. More filled materials such 
as sodalime glass have an elastic response to 6 or 7 GPa and plastic behavior be- 
yond. The experiments in [41] confirmed the existence of these waves and extended 
measurements to glass states behind the shock (but ahead of the failure front) and 
behind the failure wave using manganin stress gauges. The tensile (spall) strength of 
the glass remained high behind the shock but dropped to zero behind the failure front. 
Additionally, the shear strength of the material dropped from a high value ahead, be- 
coming lower behind the failure front. The experimental study established the sharp 
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light opacity behind a front under high speed videocamera recording [42], reduction 
in acoustic impedance, and lowered sound speed [39]. The material states behind and 
ahead of the moving boundary (behind the shock) allowed the measurement of failure 
wave speed [43]. Comparison of these data with high speed framing in sodalime and 
borosilicate (Pyrex) glasses [38, 44] revealed the constant value of the failure wave 
speed. 

Qualitative changes in silicate glasses behind the failure wave through the mea- 
sure of the refractive index (refractive index increased) allowed Gibbons and Ahrens 
[45] to qualify this effect as a structural phase transformation. These results stimulated 
Clifton [46] to propose the phenomenological model in which the failure front was 
assumed to be a propagating phase boundary. According to this model the mechanism 
of failure wave nucleation and propagation results from local densification followed 
by shear failure around inhomogeneities triggered by the shock. 

The description of the failure wave phenomenon as a consequence of the gen- 
eration of collective burst modes of mesodefects was proposed in [47, 48, 49] in the 
course of a study of nonequilibrium transitions in defect ensembles. It was shown 
that a self-similar solution for the microshear density tensor exists that describes the 
qualitative changes in the microshear density kinetics during nonequilibrium transi- 
tions. The failure waves represent the specific dissipative structures (the “blow-up” 
dissipative structures) in the microshear ensemble that could be excited due to the 
transit of the elastic wave. 



4.7.2. Self-similar solution 



Equation (15) describes the characteristic stages of damage evolution. In the range 
of stress a < Oc and defect density p < Pc the damage kinetics is subject to the 
“thermodynamic branch” oa corresponding to the local minimum of the free energy 
(Figure 1). When the stress approaches the critical value Oc (p Pc) the properties 
of the kinetic equation (15) change qualitatively (from elliptic to parabolic) and the 
damage kinetics is subject to specific spatial-temporal structures that appear in the 
defect ensemble due to the interaction between defects [12]. These structures describe 
damage localization and nucleation of failure hotspots. 

The spatial-temporal structures are given by the self-similar solution (20) of the 
kinetic equation (15) when passing the critical point pc. The substitution of (20) into 
(14) leads to the equation for /(^) 



1 ^-(g,+ l) rf/ d y df] . 

i8-r 2(^-1) ^ d^l’ d^\ ■’ 



(25) 



The conditions for / at the front of the dissipative structure and the symmetry 

conditions at the center of the structure are 



? = f = 0, r^=0, 

fCO^ = 0 . 

^ d^ 



(26) 



^ = 0 : 



(27) 
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Figure i6. Fracture time tf for (1) a shocked rod of PMMA and 
(2) ultraporcelain versus stress amplitude Gq . Inset: Surface pattern 
with mirror zones in different spall cross-sections. 

We assume that the defect density in the dissipative structure exceeds essentially 
this density outside the failure hotspot. The difference in the number of conditions 
(26) and (27) and the differential power of (25) means that the solution exists only 
for specific values of ^/ = i.e., the eigenvalue problem for arises. A method 

for the solution of this problem was developed in [13], and it allowed the estimation 
of 1/ and the definition of failure front propagation 

— i. 0) <y+l 

(28) 

Equation (28) determines three self-similar regimes depending on the relations 
between the parameters of the nonlinear medium. If the material properties and the 
stress levels provide the damage localization kinetics with parameters p > co + 1 the 
failure wave front will propagate with a group velocity given by the solution (28). 

4.7.3. Some experimental results 

The blow-up kinetics of damage localization allowed us to link the hotspots of fail- 
ure with the above mentioned self-similar structures. The correspondence of these 
hotspots having the image of mirror zones to the above self-similar structures was 
confirmed experimentally in [50, 51]. The multiple mirror zones with equal sizes 
were excited on different spall cross sections in the shocked PMMA cylindrical rod 
when the stress wave amplitude exceeded some critical value corresponding to the 
transition to the so-called “dynamic branch” under spalling (Figure 16). The constant 
size of damage localization corresponds to the damage kinetics (free energy release 
rate) given by the following relationship between nonlinearity parameters p ^ 

The “dynamic branch” corresponds to the stress a > Gc, where the failure 
scenario is determined by the generation of the collective mode in the defect ensemble 
in the form of a dissipative structure with blow-up damage kinetics. The self-keeping 
features of failure corresponding to the conditions of failure waves were also observed 
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Figure 17. Propagation of stress (S) and failure (F) waves. 



in the conditions of dynamic crack propagation. The framing of crack dynamics in the 
preloaded PMMA plane specimen established the existence of the transient velocity 
600m/s, when the branches behave autonomously (Figure 8). The low rise of 
velocity for a > 60MPa reflects the stress independent character of failure similar to 
the dynamic branch in spall fracture. The pictures of stress distribution at the crack tip 
is shown in Figure 11 for slow (V < Vc), fast (V > Vc), and branching (V > Vb) 
cracks. 

4.7.4. Simulation of failure waves 

The study of failure wave initiation and propagation was carried out on the basis of the 
constitutive equation (28) coupled with the momentum transfer equation. The defect 
density tensor in the compression stress wave represents the microshear density. 

The system of equations was solved using an original finite element code. The 
simulation confirmed the delayed propagation of the failure front behind the stress 
wave (Figure 17). We observed also that the shear stress vanishes in the failure wave. 
The failure wave propagation leads to qualitative change in the transverse stress when 
this stress is approaching the longitudinal stress level (Figure 18). 

4.7.5. Statistics of fragmentation 

Qualitative new features in crack dynamics (transition from the steady-state branching 
regimes and the fragmentation) such as the phenomenon of delayed failure revealed 
specific features of dynamic failure that are properties of nonequilibrium systems 
with nonlinear and stochastic behavior caused by the collective effects in the defect 
ensemble. This fact allows us to develop a view on the dynamic fracture statistics 
(fragmentation problem) that links the nonlinear dynamic aspects of damage evolu- 
tion with the change of symmetry properties in the system due to the generation of 
collective modes in the defect ensemble. 

Some basic theories have emerged within the past 10 years for predicting the 
consequences of dynamic fragmentation induced by high velocity impact or explosive 
events. These theories have focused on the prediction of mean fragment size through 
energy and momentum balance principles [52, 55] and on statistical issues of fragment 
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Figure i8. Longitudinal Gxx and transverse Gyy stress kinetics. 



size distribution [56, 57]. However, there are a number of unresolved issues within 
the development of statistical energy balance theories of fragmentation. 

Dynamic fragmentation was analyzed in [58] on the basis of local energy in- 
equality and minimum fracture time requirement. This theory was later modified in 
[54] to include the kinetic energy available for fragmentation. However, the theories 
are not complete because the evolution process that leads to catastrophic failure of 
the materials was not considered. 

Taking into account the dramatic changes in the scaling properties of the non- 
linear system “solid with defects” due to the re-subjection of the system behavior to 
the collective modes of defects, the following scenario of the dynamic fragmentation 
can be discussed. The existence of characteristic stages of dynamic fracture under 
crack propagation (Figure 8) allowed us to establish the correlation of the dynamic 
fragmentation statistics with the energy density imparted on the material. The en- 
ergy density E < Ec (Ec corresponds to the critical velocity Vc of the steady-state 
branching transition) provides the stress intensity controlled failure scenario. The 
transient densities Eb > E > Ec (Vc < V < Vb) lead to the fracture statistics that 
are sensitive to both self-similar solutions: the self-similar stress distribution at the 
crack tip and collective blow-up modes of damage localization. The intermittency 
effect, when the system reveals the complex statistics in the presence of two attrac- 
tors, probably, has a phenomenological interpretation in the framework of Weibull 
statistics. Taking into account the theoretically predicted low limit of damage local- 
ization scale Lc, it can be assumed that a critical energy density exists that provides 
the limit size of fragmented structure close to Lc- Since these energy densities can 
be imparted in materials generally by shock, this homogeneous fragmentation can 
be realized because of failure wave excitation. The experiments in [59] with large 
specimens (optical glass K5 impacted by blunt steel cylinder) visualized the several 
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fracture nucleations in the form of failure waves. Several spherical or nearly spherical 
waves were formed ahead of the main front. 



4.8. Plasticity as metastable orientation transition in microshear 
ensemble 

4.8.1. Defect induced mechanisms of momentum transfer 

The physical particularity of momentum transfer under plastic flow is the motion 
of the deformation carriers (microshears in our case) in the conservative (elastic) 
field. This fact reflects the principal difference of irreversible deformation caused by 
the dislocation structure rearrangement from conventionally known mechanisms of 
viscous flow in liquids, which occurs due to momentum diffusion. 

Much progress was achieved during the last decades in the understanding of the 
mechanisms of plastic flow and the phenomenology of plasticity for the large range 
of the stress and strain rate intensities. However, the attempt to use the structural 
aspects in the formulation of the plasticity constitutive equations did not allow the 
explanation of most main questions related to the specific nature of deformation 
carriers responsible for the plastic flow. Still unresolved problems are the regularities 
of the strain localization due to shear banding, the linkage of the structure evolution 
with deformation hardening, and the explanation of the structure of the plastic wave 
front in shocked materials. 

The experimental data concerning the transitions in dislocation substructures 
under the plastic flow and the results of the statistical description of collective behavior 
of microshear ensemble allowed one to link plasticity rules with structure evolution. 
It was observed in Section 4.2 that the intermediate range of structural heterogeneity 
(for instance, grain size) corresponds to values of the structure sensitive parameter 
Sc < S < 5* and provides the deformation response ofsolids caused by the generation 
of collective modes of the microshear ensemble with pronounced orientation. The 
formation of these modes is the consequence of the orientation instability in the 
microshear ensemble within the corresponding range of external stresses < Gy < 

o'HEL for the given structural parameter 8 (Figure 19). These stresses determine the 
range of the yield stresses (cfy , aHEL)> where the orientation metastability occurs. The 
yield stresses from this range represent the sum of the stresses cr^ (where orientation 
metastability can appear). This stress has an athermal nature and determines the 
initial point of availability of the coherent microshear behavior as a qualitative new 
mechanism of stress relaxation. Part of the yield stress (a^ - ap determines the 
additional stress, which provides the thermally activated transition A D in the 
dislocation substructure in the more orientated state D. 

The jump in the microshear density parameter Apad leads (according to (16)) 
to the sharp drop of elastic modulus, which precedes the plastic flow. The possible 
scenario of the microshear ensemble evolution at the point D is shown in Figure 1 9 and 
can involve two opportunities. The first one, along the Bd trajectory, here supposes 
high deformation hardening. The second scenario (along the trajectories DHE ox 
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Figure 19 . Metastable orientation transition in microshear ensemble. 




Figure 20 . Free energy “transitions.” 



D F) presumes an increase of deformation caused by the defects in the continuous 
orientation ordering in the course of metastable transitions due to the change of value 
of the structure sensitive parameter 8. Two realizations of this scenario (DE and DF) 
correspond to the typical plastic deformation of solids that are conventionally known 
as perfect plasticity (DE) and plasticity with a hardening (DF). 

This type of plastic flow is described in the framework of the incremental plas- 
ticity theory, when the increment of plastic deformation d£fj^(= dpik) is determined 
by the so-called normality rule 

= (29) 

daik 

where 0 is the plastic potential and X is the multiplier. The incremental law of plastic 
flow determines the plastic strain decrement as the result of the “expansion” of the 
yield surface, given by the plastic potential 
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The consideration of the plastic flow as nonequilibrium orientation transition 
led to the interpretation of the phenomenological background of the incremental 
plasticity. According to this result the driving force of the plastic deformation is the 
minimization of the free energy F (a, p, 8) in the current system state, for instance, 
(o'D» under the continuous ordering of defects (D -> // -> • • • ) and the 

formation of different dislocation substructures. The description of these transitions 
can be obtained as the generalization of (14) considering <5 as the independent variable 

SF = ^dp + ^dS < 0. (30) 



It is evident that the increment dS plays the role of the increment dX and provides 
the change of the plastic flow potential in term of 8. For the quasi-static loading the 
relations (30) allowed the expression of the increment d8 in the term of current values 
of stress and defect induced plastic strain p. The unloading, for instance, from point 
F occurs along the trajectory FES in the “partly unloaded state.” 

Kinetics of the defect density and the scaling parameter can be described with 
the usage of the evolution inequality in the form 



8F dFdp 8Fd8 ^ 

— — _l_ < 0. 

8t dp dt d8 dt 



(31) 



The analogue of Oy under unloading is the threshold stress a^. Taking into ac- 
count that the plastic flow realizes as continuous orientation ordering of dislocation 
substructures with the scaling parameter 8 related to the size of the dislocation sub- 
structure In and the distance between ones Ic, the real scenario of the plastic straining 
depends on the loading conditions, for instance, the stress or the strain rate. 

The strain rate sensitivity of plastic flow is observed typically at strain rates 
exceeding 10^s“^ It is important to use in this range of strain rate e > 10^s“^ 
the description of independent kinetics of defect induced plastic straining and the 
structure evolution in the form that follows from (31), 



dp dF 

dt ^ dp' 

d8 dF 



(32) 

(33) 



where Lp and Ls are the kinetic coefficients. The relations between the kinetics 
of the external loading (for instance, imposed stress or strain rate), the kinetics of 
transition in the p metastability area, and the kinetics of structure sensitive parameter 
8 determine the variety of the material behavior for high strain rate loading and the 
response of shocked materials. 

The kinetics of (5 -parameter determines the hardening law due to the formation 
of new dislocation substructures responsible for the momentum transfer in more high 
structural level that is realized as the rescaling of initial structural scales In Ln(p), 

Experimental data concerning the variation of the flow stress with a strain rate 
established the dramatic increase of the flow stress at the strain rate k > 10^ — lO'^s”^ . 
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The description of the plastic flow as a metastable orientation transition in the defect 
ensemble allowed us to propose the explanation of the anomalous hardening. Equation 
(34) can be represented in the form 



dS 

dt rs dS ’ 



(34) 



where F — vs = (LsA) ^ is the characteristic time of structure rearrangement 
in terms of 8. The anomalous hardening for the strain rate e > 10^ — iC^s”^ is 
the consequence of limiting rate ability of materials to the dislocation substructure 
rearrangement with characteristic time xs ^ 10"^s. For strain rates exceeding t > 
lO'^s"^ the plasticity, as a metastable transition to the dislocation substructure with 
more pronounced defect orientation on large spatial scales, could not be realized and 
defect ensemble evolution could occur along the branch CDD'd with a constant value 
of 8 and maximal hardening. 



4.9. Kinetics of metastable transitions and mechanical responses 
of shocked condensed matter 

4.9.1. structure of stress wave in solid under impact loading 

Complicate material responses to the increase of the strain rate are observed in the 
nonlinear behavior of deformation and in the changes of yield stresses of plasticity 
and strength. The attempts were undertaken in [47, 50, 51] to establish a link of 
the mesodefect evolution (microcracks, microshears) with relaxation properties and 
failure kinetics. A statistical approach allowed us to establish specific features of the 
defect ensemble evolution caused by the initial solid state (structural heterogeneity 
in terms of 8) and the interaction between defects. The scenario of defect ensemble 
evolution has the form of nonequilibrium kinetic transitions, which appear as a specific 
form of self-similarity. This self-similarity is particularly clear at plastic instability 
and damage localization under dynamic loading. The self-similarity in the behavior 
of a solid loaded dynamically is caused by the excitation of spatial-time structures 
in the defect ensemble. The appearance of these structures is accompanied by the 
qualitative change of solid responses to dynamic loading. 

The established correspondence of the nonlinear behavior of the defect ensemble 
and the structure of the shock wave profile leads to the explanation of some regularity 
of shocked material responses. 

One-dimensional plane shock waves have been widely used to study the me- 
chanical response of solids to high velocity deformation. For many materials under a 
certain range of impact pressures, there exist two- wave structures of which the first 
wave, the so-called elastic precursor, travels at the velocity of sound waves, while 
the second wave, the plastic shock wave, travels at a slower speed that increases with 
impact pressure. While the full two- wave structure is thus not steady (does not prop- 
agate without change of form), each component wave may be treated as steady after 
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sufficient propagation distance. The steady-wave profile propagates without change 
in shape. 

The stress wave structure was examined in [60] for the impact loading of an 
aluminum plate loaded by the quartz disk at the rate of 400m/s. In the case of the plane 
wave propagation in the z direction the system of constitutive equations coupled with 
conservation laws of mass and impulse are written as 



^p'zz 




9po _ 
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dt 


’’^Pzz 


dt 


^ dpo 
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= + Pzz^ 
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(35) 

(36) 

(37) 

(38) 



where and po are deviatoric and isotropic parts of p^^, F = -f 

F^(po, cro), p is the material density, and L'p and are the kinetic coefficients. 
Only the p'^ component is nonnull in compression waves. 

The solution of system (35)-(38) has to satisfy the boundary and initial condi- 
tions 



o^zz(0, 0 = oro(0» t) = 0, 

Mz. 0) = a(z, 0) = pzz(z, 0) = 0, p(z, 0) = po, 

where h is the plate thickness, andao(0 was determined on the basis of the so- 
lution of the collision problem. Material parameters have been determined from 
quasi-static data of aluminum testing (uniaxial tension) using direct methods of 
the registration of the microcrack accumulation. The parameters were taken to be 
p = 2.71 X lO^kg/m^ G = 109.7GPa, Tp = Lp/G = 2.1 x IQ-^s, r/ = h/Ci = 
1.96 X 10“^s^Q = {G/p)^!^). Results of a numerical simulation of the stress wave 
propagation and the time dependence of stress and microcrack density parameter in 
the spall section are presented in Figure 21. In the stress area corresponding approxi- 
mately to the dynamic yield stress the orientation kinetic transition for the parameter 
p^z occurs and results in the abrupt increase in stress relaxation tempo, a change in 
the plastic wave profile and the separation of the elastic precursor (bold parts of the 
curves in Figure 21). 

A sharp transition to the highly ordered structure due to the metastable transition 
may lead to the behavior that has been commonly referred to as dynamic failure due 
to plastic shear instability. Spatial scales of the orientation area (shear localization 
zone) are determined by the parameters of the solitary wave of shear instability given 
by the solution (18). 
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Figure 21 . Structure of stress waves and damage kinetics in spall 
cross section of aluminum target. 



4.9.2. Strain rate universality at the steady-state plastic wave 

Experimental evidence of viscous-like effects in solids was lacking until the results 
of Sakharov’s group [61] when the measure of shock viscosity in solids was inferred 
using an indirect method. The existence of steady-wave shocks as significant fea- 
tures of stress waves in solids is treated in [62, 63]. With the development of laser 
interferometry techniques the first direct measurements of high resolution wave pro- 
files were provided in [64] for aluminum. The observation of the shock wave rise 
showed the very rapid increase in steady- wave strain rate with increasing peak stress. 
This increase is substantially greater than would be predicted by a simple Newtonian 
proportionality between viscous stress and strain rate. The existence of steady- wave 
solutions for the Navier-Stokes equations was recognized first by Rayleigh and Tay- 
lor [65, 66]. Since then a large literature on this subject in the field of fluids has 
emerged [67, 68]. 

The unique feature of the large amplitude wave profile is the steady-wave plastic 
shock profile. The steady-wave profile propagates without change in form and, as 
pointed out in [69], is a consequence of a stable balance between the competing 
processes of stress-strain nonlinearity and dissipative or viscous material behavior. 

According to the results presented in Section 4.5, the steady-wave profile for 
different shock amplitudes appears as that of the self-similar auto-solitary wave in- 
duced by the collective orientation mode in the microshear ensemble. The rescaling 
of current values of S (DF-path, Figure 19) provides the self-similar auto-solitary 
response for the variation of the stress amplitude. 

The rate of the transition from the lower to upper branches reaches the max- 

imum under the deepest penetration into the metastability area (point b in Figure 19). 
This is assumed to be a main reason for the universality of the plastic strain rate 
dependence on the stress amplitude Pzz ^ established by Barker and Grady 

for a wide class of materials for strain rates s > 10^s“^ This relation follows from 
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the self- similar solution (18), when the driving force of the transition under the S 
rescaling can be represented in terms of the stress as the difference in the free energy 
for the current level along the t/G-path (Figure 19) and the energy for metastability 
limits (the analogue point b, Figure 19) for the curves corresponding to the current 
values of 8. 

This result reflects the importance of the collective effects (orientation transi- 
tion) in the system of defects providing the intriguing material independent feature 
in the steady-shock process and reveals the specific character of the solid viscous 
response. On the other hand the universality of this fourth-power dependence for a 
large class of materials leads to the assumption that in the intensive shock environ- 
ment the complex structural processes (which are involved due to the plastic flow) 
become simpler. The study of these general mechanisms seems to be important for 
the understanding of the collective effects in the defect ensembles responsible for the 
irreversible deformation in solids. 

4.9.3. Relaxation properties and defect-induced instabilities in shocked liquids 

The problem of the true structure of liquids can not at present be considered as 
finally explained. A dependence of mechanical properties on the relation between the 
relaxation time of the medium and the characteristic time of the loading is not only a 
property of solids but is also valid for liquids. All theories of viscous liquid flow start 
out to some extent with an assumption about the nature of the structure of the liquid. 
For this reason investigation of the relaxation properties of a liquid under high strain 
rate loading produced by shock waves, is of great scientific and applied interest. The 
viscosity of condensed matter plays the principal role in effecting the irreversible 
shock compression. It is well known that, if account is not taken of viscosity, one can 
not produce a continuous distribution of all thermodynamic quantities in the shock 
wave front (SWF). 

The method of investigating the viscosity of materials behind the SWF, proposed 
first by Sakharov [61], is based on the experimental study of small perturbations on 
the SWF and then by Barker [64] with the use of the Doppler interferometry (VISAR) 
technique. 

A still unresolved puzzle arose in Sakharov’s experiments when the shocked 
liquid (water and mercury) and solid (aluminum and lead) revealed the same vis- 
cosity r] = lO^Pz at approximately the same values of the strain rate s = 10^s~^ 
at the SWF. It is appropriate to emphasize that these substances are very different 
under normal conditions. Nevertheless, for the pressure P ^ 80~100KBar water and 
mercury are characterized by the same dynamic viscosity as that of an unmelted solid 
(pressure range P ^ 4(X)-5()0KBar) behind SWF. One is led to the conclusion that 
the mentioned liquids in the pressure range P ^ 80-100KBar have a relaxation time 
r > ~ 10~^s differing by six orders of magnitude from the molecular (diffusion) 

relaxation times estimated by the Einstein formula as T£> = ~ 10“^^s, where 

A is the distance between the particles and Dsd is the self-diffusion coefficient. It 
means that a liquid behaves in the quoted experiments effectively like a solid. 
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The physical mechanisms leading to the development of instabilities in con- 
densed media indicate the possibility of describing instabilities in liquids on the basis 
of an analysis of the kinetics of fluctuations, if the latter are viewed as defects in the 
structure of liquids [12, 70]. In the case of liquids the mesoscopic defects, being by 
their nature similar to the fluctuation of the displacement field in solids, can be also re- 
garded as real structural defects that are produced during collective motion of groups 
of molecules. This mechanism of motion does not correspond to the conventional 
diffusion mechanism of momentum transfer in simple liquids. 

The attempt to explain the noncontradictory nature of the viscosity of shock 
compressed liquid and solid for the strain rate e = 10^s“^ was undertaken in [61] 
using the Fraenkel’s idea concerning the role of defects (“holes”) produced by the 
propagating SWF. The validity of this view was qualitatively confirmed by the mea- 
sure of the electric conductivity of water behind the SWF, when a dramatic change 
of the resistivity was found. 

Apparently, Fraenkel [71] was the first to call attention to the analogy in the 
mechanisms of flow of solids and liquids, noting that “ . . . x-ray diffraction pictures 
of liquids are similar to those of microcrystalline solids, and it would be possible to 
interpret their general features on the basis of the idea that a liquid consists of a large 
number of randomly oriented submicroscopic-size crystals” and “ . . . the widely held 
view that the fluidity of liquids is caused by the absence of shear elasticity, i.e., by 
a zero shear modulus ... is incorrect (except, possibly, for the case of liquid helium 
II).” These statements are confirmed in [72] by measurements of relaxation spectra in 
a shear simple liquid flow accompanying the superposition of shear oscillations in a 
liquid at frequencies lO^Hz when the existence of shear elasticity was established. The 
explanation of this long relaxation time anomaly is linked in [72] with a coordinated 
displacement and reorientation of groups of molecules which involves longer times. 
A coordinated displacement of groups of molecules (similarly to the relative slipping 
of blocks or grains in solids) can be realized during nucleation of mesoscopic defects 
arising between these groups of molecules. 

The dissipative function for a medium relaxing by flow and as a result of the 
development of defects has the form [12] 

1 SF 

TPs - -—Qk^kT + ~ 1 — Pik > 0, (39) 

T dpik 

whereTis the temperature, qk is heat flux, 8F/8pik is the thermodynamic force acting 
on a system when pik is different from its equilibrium value, and = dk — Pik is 
the “viscous” component of the strain rate tensor. The condition that the dissipative 
function be positive definite leads to a system of equations for the tensor variables: 



Oik = neJk + XPik, 



(40) 



( 41 ) 
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where ry, x, and g are kinetic coefficients. For the case of simple shear, from (40) 
follows an expression for the effective viscosity 

nim = — = tl - in - x)-r-- (42) 

exz ^xz 

In the interval ^ the strain rate fluctuations “conform” to the 

spectrum of solitary waves, meaning that Sik ^ Pik^ and (42) gives the asymptotic 
viscosity rj = x- The independence of viscosity of condensed matter r] ^ 10^ Pz 
for the strain rates f 10^ — 10^s“^ is the consequence of the dependence of the 
deformation response on the strain rates induced by the defect kinetics in the form of 
finite amplitude solitary wave disturbances. 

The following scenario of the development of instabilities in a liquid is possible 
in accordance with the characteristic features of the nonlinear behavior of ensembles 
of defects. 

The weak periodic pulsations of the velocity caused by defect-induced fluctu- 
ations of the strain rates appear in the region 5 > 5*. The flow intensification can 
lead to the transition through the bifurcation point 5* (the region 8c <8 < 8^) and 
the nucleation of the cascade of solitary waves as a result of orientation transition 
in an ensemble of microscopic shears. The inertial cascade of the solitary waves is 
generated under the rescaling of 8 due to the appearance of new spatial scales linked 
with the solitary wave front. The second inertial interval appears during the transition 
for 8 = 8c, when dissipative structures with blow-up kinetics of the defect-induced 
strain rate are excited. 

It is interesting to compare the real picture of turbulent flow with the scenario 
given by the dynamics of pik . The dynamics of the development of turbulence has been 
investigated in [73] in an analysis of the evolution of a turbulent spot in a Poiseuille 
flow with Reynolds number R ^ 840-1500. It is observed that the turbulent spot has 
the form of a triangular wing, and the turbulent motion arises inside a region on the 
boundary of which disturbances in the form of solitary waves arise. These waves, 
generated on the boundary of the spot, propagate into the laminar zone and transform 
into quasi-periodic damped pulsations of the velocity. The motion of solitary waves 
in the direction of the spot results in secondary instability and growth of the spot. This 
experimentally observed scenario agrees with the nonlinear dynamics of macroscopic 
defect-induced strain rate fluctuations. The laws established for the development of 
instabilities in condensed media suggest the possible existence of a bifurcation tree, 
leading to turbulence [74] , that can be described on the basis of a kinetic approach [75] . 



4.10. Summary 

A multifield statistical approach is presented and developed for the study of the multi- 
scale nature of plastic deformation and failure. The tensor order parameter, the defect 
density tensor, is introduced for typical mesoscopic defects, microcracks and micros- 
hears, which are the structural images of dislocation substructures at different material 
scales. 
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This statistical model leads to the existence of a dimensionless structural param- 
eter of scaling, which provides the link between the continuous growth of the defect 
density under deformation and the rescaling of the structural level involved in the 
successive mechanism of momentum transfer due to the collective motion of defects. 
This parameter represents the ratio of the current scale of dislocation substructure 
and the distance between substructures. The statistical model reveals the variety in 
behavior of ensembles of mesoscopic defects caused by the interaction of defects 
among themselves and with an external field. The collective properties of defects 
due to their interaction play a most important role in the formation of large-scale 
mesodefect substructures and finally in the mechanisms of plastic flow and failure. 

Large-scale mesodefect substructures are generated as orientation transitions in 
the low scale mesodefect substructure. This transition occurs during the excitation in 
the material of the auto-solitary strain waves with a front length corresponding to the 
orientation transition area. The important consequence of this result is that the plastic 
flow develops as the continuous orientation ordering of dislocation substructures. The 
driving force of this transition is the free energy release under conditions of growth 
of the defect density and the rescaling of the structural level of the material, which 
provides the optimal mechanism of momentum transfer. 

The orientation transitions in mesodefect ensembles and the nucleation of spa- 
tial areas with pronounced orientation of defects can be considered as mechanisms of 
adiabatic shear bands and of strain localization. The spatial distribution of the initial 
susceptibility in terms of the above scaling parameter can lead to the multiple gen- 
eration of the set of collective modes (auto-solitary strain waves) in the bulk of the 
specimen. These modes can be considered as new phase variables that determine the 
material behavior and decrease the system symmetry. The irregular character of plas- 
tic deformation (Henky flow) is the consequence of the stochastic dynamics due to 
the generation and the interaction of auto-solitary collective modes in the mesodefect 
system. 

The important result of statistical multifield theory is the determination of the 
type of bifurcation that leads to qualitative change for the evolution equation of the 
defect density tensor during transition from plastic mechanism of structural relax- 
ation to the damage localization and failure. This bifurcation type arises for a criti- 
cal value of scaling parameter and leads to the generation of the collective modes, 
i.e., the dissipative structures with blow-up kinetics of the defect density growth on 
the corresponding spatial scales. The set of these collective modes is bounded with 
the coordinates of the second attractor, which defines another multiscale nature of 
damage-failure transition. 

The collective mesoscopic modes of dislocation substructures (auto-solitary 
orientation waves and blow-up dissipative structures) have the meaning of self-similar 
solutions-eigenfunction spectrum of corresponding nonlinear problems. The loading 
condition can provide the resonance excitation of this type of collective modes, which 
may lead to the anomalous deformation responses in the form of specific types of strain 
rate localization, failure waves, and self-similar structures of wave fronts in shocked 
condensed matter. 
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5. Extended Thermodynamics: 

A Multifield Theory Par Exellence 

I. Muller* 



Abstract. A monatomic ideal gas, upon close inspection, reveals itself as a com- 
plex material whose proper description requires the use of hundreds, or even 
thousands of fields, at least in the rarefied state. 



5.1. Introduction 

Nothing should be simpler than a monatomic ideal gas, or so one might think, and that 
intuitive feeling is largely confirmed by thermodynamic calculations in a dense gas. 
Indeed in such a gas the equations of Navier-Stokes-Fourier represent the constitutive 
properties of the gas quite well. However, for low pressures — i.e., at great heights — 
the gases are rarefied and the constitutive equations of Navier-Stokes-Fourier do not 
hold anymore. 

That is the range of extended thermodynamics (see [1]), a multifield theory 
that employs hundreds or thousands of moments to describe the state of the gas. It 
is shown here that only a multifield theory of that type is capable of describing the 
observed properties of rarefied gases satisfactorily. The phenomena considered are 

• light scattering in a gas, 

• shock structure, 

• the development of a shock in a shock tube. 

5.2. Field and field equations 

5.2.1. General 

The fields of extended thermodynamics of monatomic ideal gases are the moments 

P'iih-Jk j (/: = 0, 1, . . . , A^) (1) 

of the distribution function which represents the number density of atoms with a 
certain velocity c. The moments are symmetric tensors of up to order N and, because 
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of the symmetry, the number of independent variables equal 
1 ^ 1 

n = - ^(a + l)(a + 2) = -{N + l)(N + 2){N + 3). (2) 

a=0 

The field equations are of balance type. They relate the rate of change of the densities 
divergence of the fluxes Fuii 2 ...ik productions 

^Fi\i2--ik , _ i-T 

We observe that the fluxes Fifj/ 2 .../^ are themselves moments of rank k 1. This is 
typical for monatomic ideal gases. 



In order to close the system we need constitutive relations for the production 
Ui^i 2 ...ik and for the last flux In extended thermodynamics we achieve closure 

by use of the entropy principle, which determines the distribution function appropriate 
to given variables (k = 1,2,..., N). The distribution function has the form 

N 3 3 

/ = with x = EE ■E A/i/2.../jt^n^*2 * * ‘ (4) 

ifc=i/i=i ik=\ 



where k is the Boltzmann constant and y determines the smallest element dxdc of 
phase space. It is related to the Planck constant. The A's are Lagrange multipliers 
which are introduced in the course of exploiting the entropy inequality. In the end 
they must be eliminated by use of the relations 



y 



CiiCi2 ■ --Ci^ye 



Eh 



E;,=r •• Ei^=i 



(5) 



(In the usual procedure, the integrand in (5) is linearized in nonequilibrium quantities. 
After that the calculation of Ajjij in terms of ./^ is reduced to the solution of 
a linear system of equations.) 

Because of the entropy principle the system of balance equations is symmetric 
hyperbolic — at least if it is written in terms of the Lagrange multipliers A/j/ 2 .../;k — 
therefore all characteristic speeds are finite. It seems appropriate to mention this fact, 
since some of the commonly used field equations for gases are parabolic and predict 
infinite wave speeds. Such is the case for the Navier-Stokes-Fourier equations and 
this phenomenon of infinite wave speeds was called a paradox by Cattaneo [2]. 

One of the motifs for the development of extended thermodynamics was the 
desire to remove that paradox. 



5.2.2. The special case iV = 3 

For A = 3, we have 20 independent equations, which are represented in Figure 1. 
In the figure, the synthetic notation used hitherto has been abandoned and we have 
represented the moments — as far as possible — in their canonical notation: p for the 
density, u/ for velocity, T for temperature, tij for stress, and qi for heat flux. These 
are 13 moments; the remaining six, viz. A and p{ijk)y have no canonical notation. 
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, 5 dxi dxi 

k ' 



-A^t) 
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Figure i . The system of 20 moment equations. Left: Euler equa- 
tions within the frame. Right: Navier-Stokes-Fourier equations 
within the frame. 



nor specific names, other than that they are moments. However, they must satisfy the 
explicit and specific system of partial differential equations. 

Indeed, the left-hand side of the system is entirely specific and the right-hand 
side, here calculated for Maxwellian molecules, contains one single parameter or, 
which can be determined from a single measurement of viscosity or thermal conduc- 
tivity. 

We stop here a moment in order to appreciate the system of 20 equations shown 
in Figure 1 . If we ignore all terms except those in the frame of Figure 1 (left), we recog- 
nize the Euler equations which ignore all dissipation. More realistic are the equations 
of Navier-Stokes-Fourier, which we find within the frame of Figure 1 (right). We 
recognize that, according to that frame, the stress is proportional to the temperature 
gradient. We also recognize, however, that we must leave out entirely specific terms 
in order to obtain the Navier-Stokes-Fourier equations. Such terms are rates and 
gradients of stress and heat flux. 

Therefore we conclude that the theory of Navier-Stokes-Fourier may suffice, if 
rates and gradients are small. On the other hand, extended thermodynamics is needed, 
if rates are fast and gradients are steep. And, since in a gas “fast” and “steep” are 
measured in terms of mean times of free flight and mean free paths, we conclude that 
extended thermodynamics is needed for rarefied gases where flight times and free 
paths are long, while in a dense gas Navier-Stokes-Fourier may be sufficient. 

Wolf Weiss [3] has the explicit moment equations available at the touch of a 
button and he has determined the characteristic speeds predicted by them, i.e., the 
speeds of acceleration waves. The biggest one of them, the pulse speed, is listed in 
Figure 2(left): For n = 20, i.e., the system of Figure 1, the pulse speed is Ma = 1.81, 
while for n = 2300, it is Ma = 6.46, and for n = 15180, it is Ma = 9.36. In 
Figure 2(right), we see from the line identified by crosses that the pulse speed seems 
to increase monotonically, and this was confirmed by Boillat and Ruggeri [4], who 
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No. of moments 


Vma«/Co 
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Figure 2 . Characteristic speeds vs. number of moments. Crosses: 
Exact values by Weiss [3]. Circles: Lower bound by Boillat and 
Ruggeri [4]. 



calculated a lower bound for the pulse speed, viz. 




Thus for infinitely many moments the pulse speed tends to infinity. 

In some way, this result is an anticlimax when we recall that the original motif 
of extended thermodynamics was the desire to obtain finite speeds. We have a theory 
now that predicts a finite speed for all finite numbers of moments and has an infinite 
speed only for infinitely many moments. That seems satisfactory, but for those who 
like the anticlimax aspect better, let me say that by now extended thermodynamics has 
outgrown its former exclusive concern for speeds; it is now an explicit and definite 
theory for the description of rarefied gases and nowhere is this more evident than in 
its treatment of light scattering to which we shall proceed in the next chapter. 



5. Extended Thermodynamics: A Multifield Theory Par Exellence 119 



Before that, however, two remarks seem to be in order. Firstly, in a relativistic 
theory, the lower bound ( 6 ) is not valid. It is replaced by another estimate, also due 
to Boillat and Rugged [5], which implies that the largest pulse speed is the speed of 
light and that may occur only for infinitely many moments. Secondly, it is clear that, 
apart from the pulse speed, a high moment theory predicts many longitudinal speeds; 
indeed, hundreds of them, if the number of moments is in the thousands. Weiss [ 6 ] 
has calculated them all and has plotted them (see Figure 3). Some are below the speed 
of sound but most are above and the spectrum of speeds becomes nearly continuous. 
The question is why do we not see them, or, better, why do we not hear them? We 
shall discuss that question when we deal with shock structures in Section 5.4. The 
short answer is that these propagation modes are quickly damped out. 



^5 




I .■! - mi n... i 

100 200 300 400 



n 



Figure 3 . Pulse speeds. 



5.3. Light scattering 

Light scattering occurs when a light wave with frequency co falls on a gas in equilib- 
rium. It is scattered by the density fluctuations of the gas and the scattered light may 
be investigated in a detector. It turns out that the scattered light consists mostly of 
light of the incident frequency co, but neighbouring frequencies are also represented 
and in a dense gas the spectrum of the scattered gas is represented by a graph with 
three peaks, the central one at the incident frequency (see Figure 4). 

The relative heights of the peaks, their widths and distances permit us to read 
off thermodynamic constitutive properties of the gas, like specific heats, compress- 
ibilities, thermal conductivity, and viscosity. It is now appropriate to ask why it is 
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Figure 4 . Light scattering and the spectrum of scattered light in a dense gas. 



that a microscopic density fluctuation through the light which it scatters can carry 
information about a macroscopic property of the gas, like its viscosity. And this is 
where the Onsager hypothesis comes in. 

Onsager has postulated — and that postulate has been convincingly confirmed by 
experiment — that the mean regression of a microscopic fluctuation may be calculated 
from the macroscopic field equations. 

Once this postulate is accepted, it is easily exploited. What we have to do with 
our explicit set of field equations is this: 

• Fourier transform spatially, 

• Laplace transform in time, 

• solve algebraically, 

• average over initial data. 

In this manner, according to Onsager, we should obtain the scattering spectrum. And 
indeed, for a dense gas, where we know the Navier-Stokes-Fourier equations to be 
good enough, we obtain the three-peaked graph of the experiment. 

However, for even a moderate degree of rarefaction the measured spectrum 
cannot be described by a Navier-Stokes-Fourier theory; in that case we need ex- 
tended thermodynamics. In Figure 5(left), we see the experimental dots, measured 
by Clark [7] and the predictions of extended thermodynamics of 20, 35, 56, and 84 
moments. None of them agrees with the experiments. So now the reader may think 
that it is time for an adjustment of parameters in order to fit the curves. However, 
extended thermodynamics has no adjustable parameters. It is a completely specific 
theory of theories of increasing moment numbers. And the only fitting parameter is 
the number of moments. 

So we increase the number of moments. And indeed for n = 120 we obtain a 
satisfactory agreement with experiment. Moreover, the predicted spectrum of ET 120 
does not change when we increase the number of moments further. Thus in Fig- 
ure 5(right), we see that the curves of ET 120, ET 165, ET 220, and ET 286 all lie on 
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Figure 5. Scattering spectra for different moment numbers. Left: 
ET 20, ET 35, ET 56, ET 84. Right: ET 120, ET 165, ET 220, 
ET 286. 



top of each other — and on the experimental dots! We conclude that there is a conver- 
gence and that convergence allows us to determine the range of validity of the theory. 
As soon as in our theory of theories with increasing numbers of moments one theory 
predicts the same result as the previous one, the latter one was good enough. 

These results are both satisfactory and disappointing. They are satisfactory be- 
cause now for the first time we have succeeded in describing a scattering spectrum in 
a good manner. However, there is also disappointment: To be sure, in view of the ex- 
perimental findings, nobody expected Navier-Stokes-Fourier to be good enough, but 
one might have hoped that ET 13 or ET 14 would suffice. Instead we need hundreds 
of moments. In fact, the numbers between 200 and 300 that were good in Figure 5 
for a moderate degree of rarefaction become insufficient in a more rarefied gas as 
shown in Figure 6. It is true that the disagreement in that case is restricted to a plait- 
like region, but nevertheless it is disagreement. Incidentally, the plait winds around a 
Gaussian curve which reflects the Maxwellian distribution of atomic velocities which 
the scattering spectrum approaches for an increasing degree of rarefaction. 



5.4. Shocks 

5.4.1. Shock structure 

Given that extended thermodynamics is a theory for rapid changes and steep gradients 
it is only natural that shock waves should be investigated by the theory. Also, it is 
well known that the shock structure is badly represented by the Navier-Stokes-Fourier 
theory. Figure 7 shows both the measured shock thickness and the one calculated from 
the Navier-Stokes-Fourier theory as functions of the Mach number. The disagreement 
is obvious. 

This seems to be the occasion for trying out extended thermodynamics. The first 
one to do this was Grad [8], who tried his 13-moment theory. (To be sure. Grad did 
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Figure 6 . Calculated scattering spectra for ET 210, ET 225, 
ET 240, and ET 256. 




Figure 7 . Shock thickness: Experiment and Navier-Stokes- 
Fourier theory. 



not know extended thermodynamics. However, his 13-moment approximation in the 
kinetic theory of gases is essentially equivalent to ET 13.) He found that the theory, 
far from being better, was dramatically worse. Indeed for Ma > 1.65 he found no 
smooth shock at all but rather a subshock (see Figure 8 ). 

It is not certain whether Grad realized the significance of the number Ma = 1 . 65 . 
This speed is the pulse speed of the 13-moment theory. Nothing in the 13-moment 
theory can be faster, and therefore, if the shock moves faster, the information about 
its arrival cannot make itself felt in the upstream region before the shock. Facetiously 
we may say that Ma > 1.65 is the “true” supersonic range for ET 13. 
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Figures. Left: ET 13. Smooth shock structure at Ma = 1.5. Right: 

ET 13. Subshock for M = 2. 

However, this is no problem in extended thermodynamics, at least not in princi- 
ple. If the pulse speed of ET 13 is too small, we choose a theory with more variables 
from our theory of theories. And we already know that the pulse speed goes up with 
the number of moments (see Figure 2). Therefore, Au [9] has calculated the shock 
structures for higher moment theories. The task is infinitely more difficult than the 
corresponding one for light scattering because the nonlinear equations must be found 
and implemented. Yet Au was able to go up to ET 560 and to obtain shock structures 
in excellent agreement with experiments — but only up to Ma = 1.8. Clearly, one 
should go higher with the number of moments, if one wishes to obtain good results 
for higher Mach numbers. At this time, the computing power proved to be insufficient 
for the task. 

5.4.2. Shock tube experiment 

A related problem to the shock structure calculation is the question about how the 
stationary shock structure forms to begin with. Shocks, are usually investigated in a 
shock tube: a long tube initially divided by a membrane into a high pressure region and 
a low pressure one. When the membrane is removed, according to the Euler theory of 
gas dynamics, a shock and a contact discontinuity move into the low pressure region, 
while a continuous but nonsmooth rarefaction “fan” moves into the high pressure 
region. See the dashed lines in Figure 9, which shows columns of graphs for density 
p, pressure p, and velocity v at different times after the removal of the membrane. 

The solid lines in Figure 9 show the calculated results for ET 13. Let us consider 
them: For long times — actually, no longer than 313 mean times of free flight — the 
Euler result is closely approximated, except that 

• the Euler shock is replaced by a continuous and smooth shock structure; 

• the Euler contact discontinuity is replaced by a continuous and smooth struc- 
ture; 

• the nonsmooth rarefaction fan is approximated by a smooth function. 

For short times, however, we see the effect of the two characteristic speeds of ET 13, 
one with Ma = 1.65 and the other one with Ma = 0.63. Accordingly, there are two 







Figure 9 . The emergence of the shock structure in ET 13 and 
according to the Euler equations (dashed lines). 



shocks, a fast and a slow one, propagating to the right and two rarefaction fans, again 
a fast and a slow one. 

The tableau of graphs of Figure 9 shows instructively how the initial situation 
develops, by dispersion and absorption, into the final one. While at the beginning we 
still see the individual waves, we see that, at the end, the shock structure propagates 
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with the Mach number appropriate to the Euler shock and that that Ma number was 
not present initially. 

The situation is perhaps made clearer by Figure 10, which shows calculations 
immediately after the removal of the membrane and for a large number of moments. 
In that case we can also solve the Boltzmann equation and obtain the dashed line. 
The predicted fields of extended thermodynamics show the many waves appropriate 
to a high number of moments, and those fields zigzag around the solution of the 
Boltzmann equation. The zigzag line becomes tighter and tighter as the number of 
moments increases and approximates the exact solution in this manner. 




Figure io. The shock tube fields immediately after removal of the membrane [10]. 

We must realize from the graphs of Figures 9 and 10 that the highest charac- 
teristic speeds are quickly damped. That is the reason why we do not hear them. 
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6. Microstructure and Hirbulence 
in Dilute Polymer Solutions 

E. De Angelis*, C. M. Casciola*, P. M. Mariano^ and R. Piva^ 



Abstract. An appropriate picture of the interaction of polymers chains and turbu- 
lence structure is crucial to grasp the drag-reducing mechanisms of dilute polymers 
solutions. In most models the physically small diffusion is normally neglected. 
However, in the presence of a continuous spectrum of length and time scales, like 
in turbulence, the introduction of a diffusion term, however small, is crucial to 
enforce a cutoff at large wave number. Such a term can also be regarded as a 
natural consequence of a detailed picture of the substructural interactions between 
the polymeric chains and the fluid. The results obtained through numerical simu- 
lations are used in the appropriate thermodynamic framework to extract valuable 
information concerning the interaction between turbulence and microstructure. 
A general multifield formulation is finally employed to explore possible addi- 
tional interaction mechanisms between neighboring populations of polymers that 
may play a role in accounting for slightly nonlocal interactions between polymer 
macromolecules in the solvent. 



6.1. Introduction 

Rows of dilute polymer solutions are highly complex due to the interaction between 
phenomena occurring at the level of the microstructure, i.e., at the level of the macro- 
molecules, with those taking place at hydrodynamical scales. Modeling is already not 
simple for laminar flows. In presence of turbulence the scenario gets worse since the 
smallest hydrodynamical scale shrinks. In fact, even for the largest molecules, the 
interaction between the two levels does not occur in terms of spatial scales, the poly- 
mer being, almost in general, smaller than the smallest scale of turbulence. Temporal 
scales, instead, may easily become comparable when the most striking macroscopic 
effect of the polymers on turbulence occurs in the form of a substantial reduction of 
drag with the flow still remaining turbulent. 

There are typically two different (but strictly connected) approaches to the con- 
struction of a model for polymeric liquids. On the one hand, a micromechanical 
model of the elementary polymer chain may be considered as a basic block to de- 
velop a statistical description of the polymers inseminated in the solvent. Clearly, the 
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achieved level of detail largely affects the range of macroscopic phenomena that can 
be addressed. However, the microrheological complexity should be kept to a mini- 
mum, since easily, the overall description of the solution becomes too cumbersome 
for investigations of turbulence. On the other hand, one may wish to minimize ex- 
plicit assumptions concerning the microdynamics, preserving the capability of the 
model to capture the phenomenology of interest. Since the microstructure plays a 
crucial role, appropriate descriptors of the microstructure must be introduced and one 
is naturally led to a multifield description. In the present paper both these approaches 
are considered. The microrheological description is reviewed first to address effects 
related to the diffusion of the macromolecules. Afterwards, a multifield formalism 
is presented with the physical interpretation of the descriptors naturally emerging 
from the comparison with the microrheological formulation. The purpose is to estab- 
lish the general form of equations suitable to model microstructural interactions of a 
dissipative nature between neighboring material elements. 

Even in the most elementary model of polymer solutions, the thermodynamics 
involved includes irreversible terms that account for the dissipative features of the 
system consistently with the viscoelastic nature of such materials. However, at vari- 
ance with the Newtonian component of the dissipation, the peculiar dissipation of the 
polymers principally acts on a purely local basis. 

For instance, in the simplest dumbbell model, where a polymer chain is modeled 
as a spring connecting two massless beads, the dissipation is localized on the beads 
and is due to the friction in the relative motion of the beads and the supporting fluid. 
Physically, elementary models of this class assume that the spatial scale of variation 
of the microstructural descriptors is so large that no nonequilibrium effects can arise 
due to spatial nonuniformity. 

Neglecting diffusion-like effects due to spatial variations of the microstructure 
is inherently inconsistent and this simplifying procedure can be safely adopted only 
on the basis of an a posteri verification. In fact, in turbulence, it always happens that 
the solution of the system of PDFs, where polymer diffusion processes are neglected, 
blows up via an ultraviolet instability. This makes the simplified system impossible to 
solve numerically, unless stabilization is artificially achieved through the numerical 
scheme. Thus, cross-element interactions are crucial for computational applications 
of polymer modeling in turbulent flows. The present paper is organized as follows. In 
Section 6.2 a brief overview of the elastic dumbbell model is presented. Section 6.4 
is devoted to the description of the effects of turbulence on polymers dynamics by 
discussing a few numerical results. Finally, in Section 6.5, the multifield approach is 
addressed to hint at possible weakly nonlocal interaction mechanisms. 



6.2. The microrheological model 

Approaching the simulation of dilute polymers in a turbulent flow, one has to find a 
correct compromise between the need of a realistic description and the feasibility of 
the simulation. In fact, many degrees of freedom are generally required by turbulence 
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itself and the presence of polymers increases drastically this demand. Specifically, 
we address here the elastic dumbbell model under the assumption of isothermal con- 
ditions, which is one of the simplest reasonable descriptions for the dilute polymer 
solutions. The relevant mechanical model accounts for the basic physical mechanisms 
that characterize the behavior of polymer molecules. Each molecule of polymer can 
be described as a chain of linear elements. In equilibrium conditions, due to thermal 
agitation, the chains assume the aspect of statistically spherical random coils. When a 
substantial deformation rate is present, the coils elongate but naturally tend to return 
to their equilibrium configuration. The restoring force is of an entropic origin. If the 
coils are abruptly released they move back to the equilibrium configuration and their 
dynamics is characterized by a spectrum of relaxation times. 

6.2.1. Langevin and Fokker-Planck equations 

In the rheological model used here, only one relaxation time is retained and each 
molecule is described as a dumbbell, i.e., two massless beads connected by a spring. 
The spring accounts for the “entropic elasticity” of the polymer molecules while the 
viscous forces due to the interaction with the Newtonian solvent are modeled by the 
friction on the beads. The dumbbells are also forced by a Brownian noise that accounts 
for the thermal motion [2]. In order to present the relevant equations as concisely as 
possible, a linear spring is considered. 

The model provides each point in space with a collection of dumbbells, which 
introduces a component of the stress in addition to that of the solvent, the so-called 
extra stress. 

The evolution is controlled by the Langevin equations for the two beads or, 
equivalently, by the Fokker Planck equation for the corresponding probability density 
function (PDF). 

Denoted by xi and X 2 the placements of the two beads, the balance equations 
for the beads read 

f(Xl -Ui) = -H(Xi -X2) + Ii, 

f(X2-U2) = -//(X2-Xl)+|2- 

where f is the friction coefficient, ui and U 2 is the macroscopic velocity field evaluated 
at the bead positions, and and ^2 independent Gaussian white noise processes 
with identical variances = 2kfi0f . // is the spring coefficient, which is typically 
expressed as // = ks is the Boltzmann constant, © is the absolute 

temperature, and n is the number of rigid segments of length b used to model the 
macromolecule as a Kuhn chain. Introducing the end-to-end vector r = X 2 — xi and 
the geometric center Xc = (xi + X 2 )/ 2 , equations (1) are recast in the form 

^(r-Wr) = -2//r + ^^, 

((Xc-Wc) = ^c’ 

where W;. = U 2 - ui, = (ui + U 2 )/ 2 , = h ~ and = (|i + 

^2)/2 and the corresponding variances are = 2or^ and Denoted by 
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xlf(Xc, r, t)d^Xcd^r the probability to find a molecule having extension in the neigh- 
borhood of r with geometric center in the neighborhood of x^, the Fokker— Planck 
equation for V^(Xc, r, t) corresponding to the Langevin equations (2) may be written 
as [6] 



difr 

"a7 



3 

9Xc 




dXc 






dri/ 

Ir 









(3) 



where and depend by definition both on and r. In equation (3) the elastic 
force has been rewritten in terms of its potential V, and the constants Dtr and Drot> 
called translational and rotational dijfusivity, respectively, are related to the friction 
coefficient f and the variances of the stochastic processes equations 



D 



1 



2kS 

~ 



D\x — Diqx/A. 



(4) 



Equation (3) provides the evolution equation for the moments of the PDF and, in 
particular, for the following observables: the number density pp of the dumbbells 
with a given geometric center x^, defined as 



Ppi^cJ) 




the average dumbbell extention at Xc, 



rm(Xc,0 




and the covariance matrix, or conformation tensor. 



n(Xc,t) 




(g) r. 



(5) 

( 6 ) 
(7) 



The evolution equation for the number density given a divergence-free velocity field 
u then follows as 



^Pp , ^Pp 




( 8 ) 



where 



j' = 




(9) 



and w' = Wc — Uc. Uc is the velocity of the fluid at the location of the center of mass. 
Due to the arbitrariness of the numbering of the beads, xj; and should be invariant 
under the transformation r in — r, while W;- is changed into — w^. This implies that 
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Ym = 0. Concerning the conformation tensor it follows that 
SlZij SlZi j 

dt dxk 



= 2DrotP/7^/;‘ + ^irT^rj + T^ir^jr + 

( 10 ) 



where 



C'ij = -d/dXkf (w'^)kfnrj-{- f f (w'^)jirn (11) 

Je3 Je3 Jr^ 



with w' = — Kr and K = Vu. If the typical length scale associated with spatial 

velocity variations is large compared to the dumbbell dimensions, the velocity dif- 
ference between the beads can be expressed as = Kr, which is the first term in 
the Taylor series, and Wc = u^. In this case, the terms j' and C'ij vanish. It is worth 
noticing that, in this approximation, equation (8) for the number density reduces to 
a convection-diffusion equation. Thus, if the initial concentration is homogeneous, 
homogeneity persists in time and equation (8) becomes redundant. On the contrary, 
when the velocity field at dumbbell scale cannot be approximated by a linear dis- 
tribution, migration of dumbbells occurs due to the number density flux j, and the 
concentration is in general not uniform [1]. 

For long chain polymers, the Brownian forcing is typically very small compared 
to the other terms in the Langevin equations, and one may be tempted to neglected 
it in the equation for the geometric center. However, it must be retained, at least in 
principle, in the equation for the end-to-end vector, since otherwise the beads may 
eventually collapse one upon the other. Under these assumptions, all terms multiplied 
by Dtr drop from the system and a deterministic equation for the trajectory of the 
geometric center of the dumbbell is obtained. Consistently the PDF does depend on 
the position only through the velocity field and the associated Fokker-Planck equation 
simplifies to 



dx/f 

dt 



^ d [df , s / vy 
9r [ 9r ^ 8r ). 



- ^(V^Kr), 



( 12 ) 



where the dependency from the geometric center has disappeared and the velocity 
difference is linearly approximated. The corresponding equation for the conformation 
tensor follows from (10) by neglecting C and assuming Pp = const and Dtr = 0. 

Typically, as already anticipated, the translational diffusivity is so small that 
the corresponding diffusive terms are normally safely neglected in laminar flows. In 
turbulence these diffusion-like terms are instead crucial to prevent ultraviolet insta- 
bilities. Actually, turbulent flows are characterized by a continuous flux of energy 
from the low towards the high wave numbers. This effect is related to the meandering 
of vortex lines due to stretching and folding of material lines. The same process oper- 
ates on the conformation tensor, via the term KTZ-^TZK^ , and leads to the formation 
of smaller and smaller scales. In fact, the tiny translational diffusivity is a singular 
perturbation, and eventually, a sufficiently fine scale develops where the cutoff finally 
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starts operating. In numerical simulations, however, the resolution needed to reach 
the dissipative cutoff is never available for physically meaningful choices of the pa- 
rameters, and the diffusion term is usually artificially enhanced as a tool to control 
the high wave number instability [7]. 

As easily verified by direct substitution, for a linear spring, hence a quadratic 
potential V, the solution of equation (3) is a Gaussian distribution and the PDF is 
fully described by its covariance tensor 'R. In the finite elongation nonlinear elastic 
(FENE) model, where the potential is chosen as 






^max 

^max 




(13) 



with r the modulus of r, the Gaussian ansatz does not hold and the solution of the full 
equation for the PDF is required. This procedure is too cumbersome to be used in a 
direct numerical simulation (DNS) of wall turbulence. Instead a closure assumption, 
known as Peterlin’s approximation, is adopted. This allows one to account for finite 
extensibility without substantially increasing the complexity of the model with respect 
to the case of linear elasticity. 



6.2.2. Evolution equation for the conformation tensor 

Peterlin’s hypothesis, i.e., the spring coefficient of each dumbbell depends only on 
the local average elongation, allows for the introduction of a Gaussian approximation 
for the PDF (FENE-P model [17], where “P” stands for Peterlin), and reduces the 
complexity of the problem to the equation for the covariance matrix TZij, which in 
dimensionless form becomes 



+ T^irKjr -f 

where D/Dt denotes the material time derivative and 

d2 d2 

f(Tl)= ” 






(14) 



(15) 



The conformation tensor is directly linked to the extra stress T-^j caused by the 
dumbbells 



Tfj = (16) 



which provides the back reaction of the polymers on the macroscopic velocity field 
assumed to be governed by the standard incompressible Navier-Stokes equations 
with the additional terms given by the divergence of the extra stress. 

The dimensionless parameters appearing in the previous expressions are the 
following: the Deborah number De, which is the ratio between the polymer relaxation 
time and the fluid dynamic time scale; rjp, which is the ratio between the polymer 
contribution to the viscosity and the viscosity of the solvent; /?max» ''^hich is the 
polymer maximum allowed elongation. Finally, Pe is the Peclet number, i.e., the ratio 
of the convective to the diffusive time scale of the polymers. 
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6.3. Thermodynamic aspects of the polymer solution 

In order to grasp the mechanism of interaction between turbulence and microstruc- 
ture, it is helpful to present a few considerations on the thermodynamic behavior of 
dilute polymer solutions. The basic idea is that the dissipation of the polymers, i.e., 
the friction between the beads and the solvent, principally acts on a purely local basis, 
an aspect captured also by neglecting the diffusivity. In this context the thermody- 
namics of the system may be described in terms of the free energy of the ensemble 
to successively recover the additional contributions coming from the diffusivity. 



6.3.1. Equilibrium distribution and free energy 

The first term on the left-hand side of the simplified kinetic equation (12), after 
rearranging the terms in brackets, becomes 



ir 






- (-)] 

3r V*®/J 

= (t,e± log ^ log , 

where the expression for the equilibrium PDF is 



^eq(r) = ce y = 



3kB® 

2nb^ 



(12) may be rewritten as 

^ - ii. 

dt "" C 9r 



(17) 



(18) 






The term in round brackets is the time-dependent potential related to the Helmholtz 
free energy of the solution (see [12]) 

AP =tipkB@ j V^(r)log^. (20) 

Using the explicit expressions for the two PDFs, where the end-to-end vector is made 
dimensionless with the root-mean-square value at the equilibrium. 






the dimensionless free energy is 

AP = +i^[tr(7^ - 3) - log(det7^)]. 
2 De 

For the FENE-P model the free energy reads (see [2]) 

/?Lv -tr7^^ 



( 22 ) 



AP 



= [<4„ -3)log(^|^) +log(detR)_ . 



(23) 
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6.3.2. Entropy production 

The evolution equation for the free energy gives the relation between the rate of 
change of the free energy, the polymer stress power T^j Eij with Eij the strain tensor, 
and the dissipation due to the polymers. Following the chain rule, the rate of change 
of the free energy is 



DAP dAP DTI, 



V 



Dt 



dUij Dt 



where from (23), 



9^" _ I'Jp 1 3(det7e)l 

3Tlij~ dctn dTlij 



IHLi 

2De^ 



(24) 



(25) 






Hence, neglecting diffusion in equation (14), the material derivative in (24) follows as 



^ - Kii] 



- - 6f(TZu) + 



(26) 



For incompressible flows Ka = 0 and the first square brackets reduces to the 
polymer stress power (16). Finally, the last two terms in (14) can be written as 






r\p 3Tlik 



(27) 



The second square bracket in (26) reduces to 



2 3AP 



(28) 



i.e., 







= GjiGlj = GjiGji, 



(29) 



which, as a positive definite quadratic form, implies that <I> cannot be negative. Con- 
sequently, the stress power 



• E = — — -t- <I> (30) 

Dt 

is expressed as the sum of two contributions, a dissipation term <I>, and the rate of 
change of fi'ee energy accumulated or released. Since the free energy of the polymer 
chains is directly related to their entropy. 



AP = -es, 



(31) 



in isothermal conditions the configurational entropy of the polymers is given by 



DSP 

”dT 
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( 32 ) 
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To recover the effect of diffusion, it is sufficient to retain the corresponding term in 
the derivation of equation (26) to yield 

DAP 

T^E = — + cD + (j>^. (33) 

6.3.3. Energy balance 

The rate of change of the total energy Uj = U q, with U the internal energy and 
q = l/2|up, can be expressed as 



DUt 9uiTij 

p—p^ =PUifi + —^ 

Dt axj 



+ pQ^ 



where Tij is the total stress and Q enforces isothermal conditions. Given the balance 
of mechanical energy 



the equation for the internal energy reads 
DU dui 



Dt ^ dxi 



+ PQ^ 



where the stress tensor is decomposed into a Newtonian part, the pressure p and 
viscous term Oij, and a non-Newtonian part, 7^^ . Since in incompressible flows the 
pressure term is zero, the free energy of the solution is entirely given by (23). Ac- 
counting for isothermal conditions, 

DA DU ^DS 

"d7 “ "d7 “ ^ ^ 

where S is the entropy of the solution, whose rate is given by 

DS 1 3m, <I> Q 






+ i + 

^ ^ © 



In the entropy, two contributions are naturally identified, one associated with the 
polymers (32) and the other given by 



In the above decomposition, the stress power normalized by the temperature cor- 
responds to the entropy exchange between the two components. Typically, when the 
polymers are elongated, their entropy decreases, and consequently, the stress power 
tends to increase the other contribution by a corresponding amount. The previous 
remarks play a crucial role in understanding the interactions between turbulence and 
microstructure. The equation for the kinetic energy q is 

Dq d f 2 ^ 1 






( 40 ) 
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where the term in parentheses can only spatially redistribute energy. The other terms 
are the energy dissipated through molecular viscosity of the Newtonian solvent and 
the stress power. The latter is responsible for the energy drain and supply due to 
the polymers. The polymers introduce an additional dissipation term, and equation 
( 33 ) provides the link between energy exchange, extra dissipation, and polymer free 
energy. This means that kinetic energy may be, at least partially, converted into free 
energy and vice versa and that spatial energy transfer could, in principle, occur via 
convection of polymer molecules, which extract energy from the macroscopic field 
at a certain point and might give it back elsewhere. 



6.4. Polymer dynamics and turbulence 

In this section, a few properties of turbulent flows are recalled to discuss the effect 
of turbulence on polymers. Numerical results are used to discuss the alteration of 
turbulence by polymers. 

6.4.1. The scales of turbulence 

Turbulence is associated with chaotic behavior in both space and time. Often several 
symmetries are present in the system, such as stationarity and homogeneity in one or 
more directions. For example, a turbulent flow in a channel may be assumed to be 
statistically stationary and invariant under translations in the directions parallel to the 
channel walls. The typical turbulent field presents a characteristic scale, the integral 
scale of turbulence L, above which the correlation tensor, defined by 

Rij{x-\o) := {(ui{x,t)uj{\oJ))), (41) 

where the double angular brackets denote an ensemble average with respect to tur- 
bulent realizations, rapidly falls to zero. The correlation tensor may be Fourier trans- 
formed in the directions of homogeneity to yield the energy spectrum. For the channel 
geometry, the spectrum is given by 

E(y, yo; k) := 77 ^ f Rij(x - - xo)d(z - zo), (42) 

{IttY J^2 

where x and z are the two coordinates parallel to the walls and y is normal. The typical 
spectrum exhibits a viscous cutoff above a certain wave number kK that defines the 
smallest characteristic scale of the turbulent field rj = In IkK, called the Kolmogorov 
scale. The presence of the cutoff implies that the kinetic energy content in the Fourier 
modes greater than k^ is negligible. This behavior is well known in Newtonian flows 
at large Reynolds number, where the small viscosity coefficient makes dissipation of 
energy efficient only at scales comparable with the Kolmogorov scale. Actually, the 
Kolmogorov description of turbulence provides a link between the scale r] and the 
local average dissipation rate e 



( 43 ) 
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and predicts the existence of a range of scales much smaller than L and much larger 
than 77 , the inertial subrange, where the direct effect of viscosity is negligible and 
where the turbulence is unaffected by the specific energy injection process. In this 
range, characterized by a self-similar spectrum of the form E(k) a energy 

is transferred at a constant rate from small to large wave numbers. This cascade 
process continuously moves energy from the larger scales, where power injection 
occurs due to flow instabilities or external forcing, towards the smaller scales, where 
dissipation takes place. The physical mechanism behind energy transfer is provided 
by the advection term in the Navier-Stokes equation via stretching and tilting of 
material lines. Actually, in in viscid flows the evolution of vorticity, parallels that 
of material line elements, as implied by the transport equation 

^-^K = 0. (44) 

To grasp the nature of the effects of turbulence on polymers it is instrumental 
to comment on the dynamics described by an equation of the form 

DU T 

— - (K72. + TIYJ) = 0, (45) 

which involves the crucial nonlinear operator appearing in the equation for the con- 
formation tensor, equation (10). According to equation (45), U is frozen in the fluid, 
exactly as for a vector governed by equation (44), since again U is entirely recon- 
structed from its initial condition and the deformation gradient F. 

The analogy implies that the same mechanism of stretching and folding of 
material lines produces a similar cascade process in equations of the form (10). In 
fact both the equations for vorticity and conformation tensor contain additional terms. 
To stop the cascade, the crucial terms are those related to diffusion. As already recalled, 
the process of meandering of vortex lines stops at the level of the Kolmogorov scale. 
The corresponding process on U may instead continue indefinitely, unless diffusion- 
like terms are retained in the equation. The picture is essentially the same found 
in the mixing of strictly nondiffusing tracers: In the stretching and folding process, 
material elements with different concentrations are brought into contact, implying the 
possibility of unboundedly large concentration gradients. Concerning polymers, the 
situation is worse since they react back on the velocity field through the divergence 
of the extra stress, which is directly related to U, equation (16). The process develops 
high wave number modes, and due to the small diffusivity of the macromolecules, 
the physical cutoff is moved to such large wave numbers as to make a well-resolved 
numerical solution unaffordable, unless numerical dissipation is introduced in the 
equations. 

As shown in the next section, the artificial diffusivity approach makes interest- 
ing simulations available for the investigation of polymers in a turbulent environ- 
ment. However, in principle, at very small diffusivities, intense spatial gradients of 
the descriptors of the polymers are produced. This raises the question, addressed in 
Section 6.5, of possible additional microstructural interactions, beyond diffusion and 
migration, between neighboring material elements. 
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6.4.2. The alteration of wall turbulence 

The model described in Section 6.2.1 has been successfully applied to DNS only 
recently [13, 5, 14]. In the channel flow simulation discussed here, turbulence is 
maintained through a pressure gradient that varies in order to provide a constant mass 
flux. Figure 1 compares a portion of the time history of Re^ with the Newtonian 
simulation with the same mass flux. Here Rer = Uxhjv, with Ux = y/Xx^lp, tw being 
the average tangential stress at the wall. A reduction in the average value of Re^ 
from 175 for the Newtonian to 150 for the viscoelastic case is apparent, giving a drag 
reduction of 36%. It is worth recalling here that the chaotic behavior of turbulence 
is always associated with the random local occurrence of certain typical patterns in 
the relevant fields, which are known in the literature as coherent structures (see, e.g., 
[11, 16]). The presence of regions of more or less locally ordered motion is actually 
the origin of the intermittency of the energy dissipation, which is found to occur in 
spots immersed in regions where energy is mainly spatially advected. 
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Figure i. Friction Reynolds number Rer vs. time for the Newto- 
nian (upper plot) and the viscoelastic flow (lower plot). The simu- 
lations have been performed on a domain Inh x2hxl .2nh with a 
computational grid of 96 x 129 x 96 at a nominal Reynolds number 
of 5000. The specific parameters of the viscoelastic simulation are 
De = 10, = 0.1, = 1000, and Pe = 1700. 

In fact intermittency was a most unexpected feature found as soon as anemome- 
ters with sufficient spatial and temporal resolution became available. The coherent 
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structures are particularly significant in shear flows where they provide the main 
source of turbulent kinetic energy, being associated with the relatively large scale 
fluctuations responsible for the Reynolds stresses, hence for energy draining from the 
mean field. Numerical simulations have been able to isolate the elemental phenomena 
underlying birth, evolution, disappearance, and rebirth of the coherent structures, the 
so-called regeneration cycle of wall turbulence. This is done by considering a small 
portion of the flow where only a few coherent patterns can be found. In such con- 
ditions all spatially averaged quantities show characteristic fluctuations that are the 
footprints of the regeneration process. This behavior is recognized in the time history 
of the average wall friction, as expressed by the friction Reynolds number shown in 
Figure 1. There the random fluctuations associated with the chaotic dynamics of the 
wall structures display time scales that are substantially increased by drag-reducing 
(bottom plot) with respect to classical Newtonian cases (top plot) [15]. The dimen- 
sions of the structures involved in the process may be estimated by addressing the 
correlation tensor, equation (41). Significant examples are reported in Figure 2, where, 
with respect to Newtonian flows, the transverse correlation lengths are substantially 
increased [18]. 




Figure 2. Spanwise correlation at := u^y/v = 10 of the 
velocity components for the viscoelastic flow: Ruu (solid line), Ryy 
(dashed line), R^w (dotted line) vs. z'^ = u^z/v. 

Qualitatively, Figure 3 offers the explanation of this effect by showing a well- 
organized pattern in the streamwise velocity field near the wall. This substantial level 
of organization is inherited by the viscoelastic reaction, the divergence of the extra 
stress, equation (15). The organization of the polymers passes through a process of 
time scale selection, with the polymers most affected by fluctuations at frequencies 
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comparable with their natural time scale, defined as the principal relaxation time. It is 
worth emphasizing that these scales, belonging to the intermediate hydrodynamical 
frequency range, carry a dynamics that is largely affected by convection, implying 
that the correct framework for physical interpretation is the Lagrangian one. The 
DNS also makes available all the thermodynamic quantities discussed in Section 6.3, 
namely, the free energy, the dissipation, and the polymer stress powers, as well as the 
different contribution to the entropy. As an example, in Figure 4, the rate of change of 
the polymer free energy is superimposed to the isolines of the streamwise perturbation 
velocity. From the figure, the correlation between the elementary regeneration events 
and fi*ee energy accumulation are apparent. 





Figure 3. Plane section parallel to the walls at y~^ = 30. Top: 
Contour lines of the streamwise component of perturbation veloc- 
ity. Bottom: Streamwise component of the viscoelastic reaction, 
defined as the divergence of the extra stress. 
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z 

Figure 4. Cross-flow section: Isocontours of the streamwise com- 
ponent of perturbation velocity and of the rate of change of the 
polymer free energy, lines, and flood, respectively. 



As a summary, the numerical results obtained by the FENE-P model with diffu- 
sion show the capability of the model to shed light on the rather involved dynamics of 
polymers in a turbulent environment. A point remains open to discussion concerning 
the possible existence of mechanisms of interaction of nearby populations of poly- 
mers at small diffusivities, a topic addressed from a general perspective in the next 
section. 



6.5. A multifield description of substructural interaction 

The microrheological models presented in the previous sections describe the statis- 
tical behavior of the polymers limiting possible interactions between neighboring 
elements to diffusion or migration effects. As already mentioned at the end of Sec- 
tion 6.2.1, diffusion terms are crucial to control the ultraviolet instability that may 
take place in a turbulent environment. A general investigation of the possible cross- 
element interactions is thus worthwhile. However, the adopted point of view requires 
a clear knowledge of the microdynamics, which may be lacking in more general 
conditions. To overcome this difficulty it is useful to adopt a general framework to 
generate appropriate balance equations for standard and substructural interactions, in- 
dependently of any special microrheological mechanism. Specific phenomenological 
constitutive equations may then account for the different substructural mechanisms 
to be considered. In any case, such a general framework should recover as special 
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case the equation obtained from specific microrheological models able to fit experi- 
mental data for polymeric liquids. To this purpose the appropriate setting is given by 
the multifield theories introduced to study continua with microstructure [3, 9], where 
proper descriptors are used to derive the balance equations for the interactions. Here, 
conforming to the previous approach, the conformation tensor % is selected as the 
appropriate descriptor for the population of the polymeric chains. The requirements of 
invariance with respect to changes of observers are then applied to obtain the balances 
of standard and substructural interactions. 



6.5.1. Placements and descriptors 

The regular region of the three-dimensional Euclidean point space occupied by the 
fluid in its reference configuration is denoted by So- Points of So are indicated with 
X. The standard deformation of So is described by orientation-preserving diffeomor- 
phisms x(-)-^o The current configuration x(So) = S and is regular, too. Any 

regular subset b of S is called part of the fluid in the following. The picture of the 
configuration of the fluid just summarized is not sufficient because no geometrical 
information is given on the substructure generated by the polymeric chains. Here each 
material element P placed at X in the reference configuration is endowed with a pop- 
ulation of chains. Problems related to possible entanglement of polymeric chains are 
presently excluded. Following the microrheological approach, each chain is described 
by an end-to-end stretchable vector r and at each point X of Bo a distribution function 
^P(X) (r) represents the population of the chains within the material element P placed 
at X. The requirement of indifference to the transformation r — r suggests that the 
minimal complexity in the description is achieved by using the second-order tensor 

7^*(X)=/' Vfp(X)(r)r®r (46) 

to provide a rough coarse-grained geometrical descriptor of the population itself (see 
(7)). TZ^ is used as the order parameter in the following, where the star denotes the 
material representation of the field. 

The descriptor TZ^ of the substructure is also associated with each material 
element. A spatial representation TZ of V? is clearly given by 7?. = 7?,(x(X)). The 
current placement and the value of the order parameter at the instant t of the material 
element at X are given by x(X, t) and 7?.^(X, t). Rates in the material representation 
are given by x(X, t) and 7t^(X, t) while in the spatial representation they are given 
by u(x, t) and 7t(x, t) = DTZ(x, t)/Dt. Clearly, x = u and by the chain rule TZ = 
dtTZ-}- (yTZ)u. 

As a crucial point of the modeling in the present context, the order parameter 
is considered an observable object; i.e., it should sense a change of observer. Two 
external spatial observers (e.g., O and O'), seeing the same configuration, evaluate 
two different pairs of rates. The relevant rules of change of observers related by 
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rigid-body motions, i.e., by proper orthogonal tensors Q, read 

u* = u -f c(0 + q{t) X (x - xo), (47) 

n*=n-^Aq. (48) 

In (47), c(0 is the translational velocity, q the rotational velocity (i.e., the characteristic 
vector of Q^Q), and xo a fixed point in space, while in (3) the linear operator A is 
a third-order tensor whose covariant components are given by Aijk = ^UjT^ik — 
where e is Ricci’s tensor (see, e.g., [9] for a complete description of changes 
of observer). 

6.5.2. Microstresses and balance equations 

Since the order parameter is an observable object, the interactions developing power 
in the rate of change of the order parameter have to be accounted for. In the following 
only contact interactions exchanged by neighboring material elements are considered 
as a consequence of the rearrangement of the polymeric chains in each element. These 
mechanisms, though as yet unidentified in microrheological terms, are to be thought 
of as additional to the diffusion discussed in Section 6.1, which is not readdressed 
here under the multifield viewpoint. For the sake of simplicity bulk interactions on 
the polymeric chains are not considered. These bulk interactions could have a twofold 
nature: (i) they could account for an inertial contribution of the agitation of the chains 
on the macroscopic motion; (ii) they could be of electrical nature in the case of 
polyelectrolyte polymers (see, e.g., [8]). However, the inertial contribution of the 
chains is basically negligible and the case of applied external electric fields are not 
considered. 

Given any arbitrary part b of B, is the power developed on it by all the 
external standard and substructural interactions as a linear functional on the space 
of the rates. In this way the general framework of multifield theories describing 
material substructures is followed as developed in [3, 4, 9] and the substructural 
interactions between neighboring polymeric chains is represented accordingly. With 
these premises, the expression of is given by 

Vl^\u,TZ)= f b-u+ f (Tn.u + 5n-7t), (49) 

Jb Jdb 

where b is the vector of standard bulk forces, T the Cauchy stress, n the outward unit 
normal to the boundary 9b of b, and <S is a third-order tensor called the microstress. 
It represents the interactions exchanged by neighboring material elements across the 
surface 9 b as a consequence of the rearrangement of the polymeric chains in at least 
one of them. 

The basic requirement is then 

is invariant under changes of observers ruled by (47) and (48)/or any 
b, i.e., 

TZ*) = h) 

for any choice of c{t), q(r), and b. 



(50) 
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As a consequence of the requirement of invariance, the equation 



c • 



+ q * f f (X - Xo) X b + / ((x - Xo) X Tn + A^Sn) 

\Jb Jdh 



= 0 



( 51 ) 



follows for any choice of c(t) and q(f). The product A^S is a second-order tensor 
whose CO variant components are given by AfijSijk- Thanks to the arbitrariness of 
c(0 and q(0, the validity of the integral balances of momentum and moment of 
momentum are, respectively, obtained as 

fh+f Tn = 0, (52) 

Jh Jdh 

/ (x “ Xo) X b + f ((x ~ Xo) X Tn + A^Sn) = 0. (53) 

Jb Jdb 

The arbitrariness of b allows one to obtain the standard pointwise balance of momen- 
tum 



b-hdivT = 0, (54) 

which can be clarified further when the vector of the bulk forces is considered as 
the sum of an inertial and a noninertial contribution that (49) may be written in the 
standard way as 

b"^ + divT = pii. (55) 

More delicate is the treatment of the integral balance of moment of momentum. 
Thanks to the arbitrariness of b and the validity of (53), the pointwise balance 
reads 

eT = div 5 + (V>1^)5, (56) 

where, as in the previous sections, V is the gradient with respect to a:. To assure the 
validity of (56), the existence of a second-order tensor such that 

= eT - iVA^)S (57) 

is necessary. This allows one to write (56) as 

div 5 - z' = z" (58) 

with z" an arbitrary second-order tensor such that 

A^z" = 0. (59) 

The sum t1 + z" is indicated by z and is called the self -force in the current jargon of 
multifield theories [3, 4, 9]. As the microstress represents the interactions between 
neighboring material elements, the self-force accounts for the interactions between 
chains in the same material element. In other words, in the absence of bulk motion, to 
maintain at the (dynamical) equilibrium the population of polymeric chains in a given 
material element, it is necessary that the interactions exchanged between elements of 




6. Microstructure and Turbulence in Dilute Polymer Solutions 



145 



the population at the given element are balanced by the interactions exchanged with 
the populations of the neighboring elements. 

6.5.3. The dissipation inequality and the evolution equation for the descriptor 

A mechanical dissipation inequality (an appropriate isothermal version of the second 
law) is here used to obtain constitutive restrictions relating the measures of interaction 
listed above to the free energy of the fluid. It is consistent with the developments in 
Section 6.3: Basically, it is a generalization of (26) written in isothermal conditions. 
The inequality states that for any part b and for any choice of the rates and their 
gradients 

^ f A-Vl^\u,n)<0, (60) 

at 

where A is the total free energy density per unit volume (i.e., A accounts for A^ and 
the bulk free energy of the fluid flow). The free energy is assumed to have a state 
dependence of the type 

A = A(F, 7^, V7^). (61) 

Taking into account the arbitrariness of the part b and the validity of the bal- 
ance equations (55) and (58), and developing the time derivative of A from (61) at 
thermodynamic equilibrium, equation (60) reduces to 

(3fAF^ - T - {VnfS) • K + (^V7^A -S)-in 

(o2) 

-h {d'jiA — Z) • < 0, 

where dy represents the partial derivative with respect to the argument y and, in 

developing calculations, the identities F = FK and V7Z = VTZ-\-(VTZ)K were used. 
Because the inequality (62) is linear in their rates and gradients and must hold for any 
choice of them, the following identities must hold: 

T = dpA¥^ -{VTlfS, 5 = 3vtcA, z = dnA. (63) 

However, the free energy must remain unchanged under the action of the group of 
material symmetries. In the case of a fluid this group coincides with the unimodular 
group of all second-order tensors H with jdetHI = 1. Consequently, (61) must 
remain unchanged with respect to the transformation F ^ FH. However, as the 
tensor H = |detF|F~^ is obviously unimodular, (61) reduces to 

A(F, 7^, V7^) = A(t, n, WTZ), (64) 

where i = |det F[ is the specific volume of the fluid. Then the constitutive restrictions 
(63) reduce to 



T = i3,A(i, TZ, V7^)I - (V7^)^^v7^A(t, TZ, VIZ), 
<5 = 9v7?.A(t, TZ, VTZ), 
z = d-jiA^i, TZ, VTZ). 



(65) 

( 66 ) 
(67) 
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Note that the simplest expression of (6 1 ) is a decomposed free energy of the form 
A(i, Tl, VTl) = Ai (i) + A2(TI) + ^a\Vnf (68) 

with a some constant. In this case, the balance of substructural interactions (61) 
reduces to a Ginzburg-Landau equation for 

a ATI- SnM(T^) = 0. (69) 



When a partially coupled form like A(l,TZ, VTZ) = As( 4, 7?.) + is allowed, 

the expression 



aATZ — 7?.) = 0 



(70) 



is obtained. Far from thermodynamic equilibrium, some measures of interaction are 
the sum of a rate dependent nonequilibrium part (ne) and a rate-independent equilib- 
rium part (eq). In particular, the decompositions 

T = T""(i, 7^, V7^; K, TZ)-i-T\L, 7^, V7^), (71) 

z = 7^, V7^) -f z"®(^ 7^, V7e; K, Tl) (72) 



are considered, where and are given by (65) and (67). By inserting these 
decompositions into (62), the reduced dissipation inequality 

Tne.K-fz"®.7t<0 (73) 



is obtained. 

6.5.4. A revised form of the evolution equation for the conformation tensor 

Physical instances suggest the transformation of the local dissipation inequality (73) 
into 



T"® • K + z"® • < 0, (74) 

V 

where TZ is the upper-convected derivative of the tensor TZ following the velocity 

field u. In other words, TZ = TZ — KTZ — TZK^ . 

Since the self-force represents the self-interaction of each polymeric chain on 
itself and the interactions between the elements of the population of a given material 
element, z^® does not develop power in the collective motion of the population due 
to the transport of the fluid. Rather, z”® develops power in the relative motion of 
the chains with respect to the fluid and one another. This is the basic reason of the 

V 

substitution of the term z^® • TZ with z"® • TZ and the corresponding substitution of 

z^®(i, TZ,VTZ\K,TZ) with z”®(4, TZ,VTZ;K,TZ). Reasoning of this kind is consistent 
with relation (1), which indicates the relative motion of the polymeric chain with 
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respect to the fluid as a source of dissipative behavior. A solution of the inequality 
(74) is 



T^"=BiK + B 2:^, (75) 

z”" = B 2K + B3:^ (76) 

with Bi , B2, and B3 appropriate definite negative constant fourth-order tensors. When 
B2 vanishes identically and Bi and B3 reduce to scalars, equation (70) may be changed 
into the time-dependent Ginzburg-Landau equation 

B^TZ = oiATZ — djiA^^L^TZ), (77) 

and the balance of momentum reduces to the standard Navier-Stokes equation. The 
evolution equation (14) is a special case of (77), which is also able to capture more 
complicated situations due to the coupling in A3 between l and IZ. The term aATZ 
is a natural consequence of a rather detailed picture of substructural interactions and 
accounts for weakly nonlocal interactions among the polymeric chains. However, it 
should be emphasized that its origin is different and complementary to the diffusion 
of macromolecules as discussed in Section 6.1 . An extention of the present multifield 
approach to account for diffusion can be developed following the same guidelines 
but including a continuity equation for the density number of the polymeric chains. 



6.6. Concluding remarks and perspectives 

The results obtained by the DNS for a dilute polymer solution using a FENE-P model 
have been investigated to study the interaction of the microstructure with turbulence. 
The instantaneous fields show that the viscoelastic reaction is organized on the same 
scale of the structures responsible for the velocity fluctuations. Large positive values 
of the stress power are observed in correspondence with intense instability events in 
the wall layer. 

The role of diffusion in stabilizing the ultraviolet instability of polymer dy- 
namics in a turbulent environment has been discussed at length. The capability of 
the multifield approach to model from a general viewpoint, complex microstructural 
dynamics has been exploited to explore weakly nonlocal interactions between poly- 
mer populations of neighboring elements even when diffusion effects are negligible. 
Though not conclusive, some indication is found that dissipative interactions different 
from pure diffusion may play a role in turbulent polymer solutions. 
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7.1. Introduction 

The work presented here is concerned with stress theory on differentiable manifolds. 
In addition to the theoretical interest as to the geometrical structure needed for the 
formulation of stress theory, such a general geometrical setting is used in theories of 
materials with microstructure to model the space where the “order parameters” are 
valued (see [2]). This sequel to our previous work on this subject (see [5, 6, 7, 8, 9]) 
follows [11] and focuses on two aspects of stress theory: 

(a) the stresses are associated with bodies whose material structure is induced 
by an extensive property rather than assumed a priori; 

(b) considering the electric charge as the extensive property, the general setting 
allows us to study the Maxwell stress tensor of electromagnetism for the 
case where a metric structure is not available on spacetime and without any 
reference to a constitutive relation for the electromagnetic fields (e.g., the 
aether relations). 

Naturally, the theory is formulated on spacetime, taken to be an m-dimensional 
differentiable manifold ^ . Spacetime is assumed to be orientable and a specific 
orientation is assumed to be chosen. It is noted that we do not use a connection in the 
analysis. 

Before the presentation of these aspects in the main body of the paper, we 
present in Section 7.2 the motivation for the formulation of continuum mechanics 
on manifolds using differential forms even when microstructure is not present. The 
presentation in Section 7.2 is elementary and it is hoped that it will convey the main 
ideas to a wide readership. 

Following the short introduction of notation in Section 7.3, Section 7.4 describes 
the way an extensive property induces a material structure (see [10]). Following 
[3], the point of view adopted is that the material points may be defined as integral 
manifolds of the flux vector field. Here, we do not assume the existence of a velocity 
vector field but obtain the flux field for the property under consideration from Cauchy’s 
theorem for scalars. Furthermore, the general geometrical structure assumed implies 
that the flux field is an (m — l)-form on the m-dimensional manifold rather than a 
vector field. Nevertheless, we show that the relevant aspects of the material structure 

*Department of Mechanical Engineering, Ben-Gurion University, Beer-Sheva, Israel, rsegev© 
bgumail .bgu.ac. il. 
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may be defined. It is noted that in a spacetime formulation the flux object contains 
information regarding the density of the property. 

Section 7.5 gives a short description of stress theory on manifolds (see [7, 8, 
9]) and Section 7.6 combines it with Section 7.3 by considering stress theory for 
bodies whose material structure is induced by an extensive property. The resulting 
stress object is a tensor that transforms the flux (m — l)-form of the property under 
consideration into the energy flux. Section 7.7 discusses the representation of such 
stresses and it is shown that there is a simple isomorphism between tensors on the space 
of (m — l)-forms and tensors on the tangent spaces. Thus, the stress-energy tensor 
may indeed be represented by a linear mapping of vectors as in the classical case. 

Finally, we apply the foregoing framework to electromagnetism in Section 7.8. 
The property under consideration is the electric charge. While traditional formulations 
of electromagnetism rely on a metric structure, our formulation of the Maxwell stress 
tensor uses only a volume element. We conclude by giving a generalized equation for 
the Lorentz force. 



7.2. Motivation 

Consider an extensive property that we denote by p such as the internal energy or the 
electric charge. One of the basic notions of continuum mechanics is that of the flux 
vector j associated with the property. Thus, for the internal energy property we have 
the heat flux vector field and for the electric charge we have the current density, etc. 
The flux vector field allows one to compute the rate at which the property p leaves 
any region IZ through its boundary as 

/ j nt/a, 

JdU 

where n is the unit normal to the boundary dK pointing out of the region. So far, 
we have not indicated whether the region TZ is the image of the current configuration 
of a body B in space ^ or a certain reference configuration. Thus, we will reserve 
the present notation for the case where values are measured with respect to current 
configuration k in space. For the variables relative to the reference configuration we 
will use the corresponding uppercase letters. So, for example, N is the unit vector 
normal to the boundary at the reference configuration. If k is the deformation of 
the body from its reference configuration then the corresponding integral over the 
reference configuration is 

f J-NdA. 

Traditionally, J is referred to as the Piola-Kirchhoff flux vector field and j is 
the Cauchy flux. The relation between the Cauchy flux and the Piola-Kirchhoff flux 
is obtained by the requirement that the two integrals should be equal on any subset 
of the boundary 3TZ so that with some abuse of notation we may write 

J • N JA = j • n da. 
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Consider an arbitrary infinitesimal vector U at the reference configuration, the image 
of which at the configuration /c is u = F(U), where F = Dk is the deformation 
gradient. Then the volumes 

dV A 



of the infinitesimal parallelepipeds generated by U, dA and u, da, respectively, are 
related by 

dv = \F\dV, 

where |F1 denotes the determinant of the deformation gradient. Hence, 
U-NIFMA =u-n^/a 

= F(U)-nJa 

= U- F^(n) da, 

and the fact the U is arbitrary implies that 

|F|NdA = F^(n)</a. 

One then writes 



j • n = J • N JA 

J • F^(n) da 

_ F(J) • n da 

" W\ ■ 

Since the last equality has to hold for any region so that nda is arbitrary, one 
finally obtains the traditional relation between the two forms of the fiux, i.e., 

J = |F|F-'(j)- 

The point of view we want to emphasize here is that both Piola-Kirchhoff’s and 
Cauchy’s flux vector fields actually represent the same physical quantity, the flux of the 
property p inside the body. The current configuration, reference configuration, and any 
other configuration merely assign different coordinate systems on the body. Hence, a 
single mathematical object should represent the flux. That mathematical object may be 
represented by the Cauchy flux vector field with respect to the current configuration 
or the Piola-Kirchhoff flux with respect to the reference configuration just like a 
vector is represented by different sets of components when one considers two bases 
for a vector space. Thus, for the mathematical flux object, the relation between the 
Cauchy and Piola-Kirchhoff vector fields will be a transformation rule between the 
two representations. The relation between the Cauchy and Piola-Kirchhoff fluxes 
clearly indicates that the flux object is not a vector field (since the transformation 
does not involve only the application of F~^). 
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7.2.1. The flux density form on the boundary of a body 

The basic mathematical objects we can handle are scalars that remain invariant under 
a deformation, vectors that transform using the deformation gradient, and the various 
linear mappings whose transformation rules are induced by the transformation rule for 
vectors. We note that the area of a material subset of the boundary of a body is not an 
invariant scalar. In addition, the unit normals n and N are not related as vectors. Hence, 
we do not have an invariant unit normal associated with a body that is deformation 
independent. Consider a fixed body Ti. and a material point X on its boundary. The 
tangent plane to the boundary will be denoted by Tx 97?. and vectors in it are conceived 
as pointing from X to infinitesimally close neighboring points. Note that if we do not 
want to represent the vectors by components, the particular configuration of the body is 
immaterial . For this reason, we may use u , for example, to denote such a tangent vector 
and its representations in the current and reference configurations are the previous u 
and U, respectively. Consider an infinitesimal triangle on the boundary whose edges 
are two tangent vectors u and v. We will denote by t(m, v) the infinitesimal flux of the 
property through the parallelogram formed by the two vectors, i.e., the flux through 
the triangle multiplied by 2. If we study r(w, u) as the vectors u and v vary, it is 
natural to assume that for such an infinitesimal triangle the value depends linearly on 
both arguments. In other words r(-, •) is bilinear and we have 

r(aiui + ^2M2» + hv2) 

= aibir(uu vi) + aib 2 r(uu V 2 ) + fl2^ir(w2, i^l) + a 2 b 2 x{u 2 , U 2 ). 

In addition to being bilinear we expect r(w, u) to vanish if the edges of the 
triangle are situated along one line. In other words, x(u,v) = 0 if the two vectors are 
not linearly independent, and in particular, t(m, w) = 0 for any vector u. These two 
requirements imply that for any two vectors 

t(m, v) 4- t(u, u) = t(m, u) 4- t(m, v) 4- r(u, u) -f r(u, v) 

= x{u 4- u, M 4- u) 

= 0 . 

Hence,r(w, v) = —r(i;, M)andweconcludethatasatensor,r is antisymmetric. Thus, 
we expect that the flux density on the boundary is represented by an antisymmetric 
2-tensor, a 2-differential form on the boundary dTZ, Note that since the tangent space 
is two dimensional, the dimension of the space of antisymmetric 2-tensors is one. It 
follows that it is enough to know the value of the flux for the triangle constructed by 
any two vectors to determine the value of r at X. 

7.2.2. Orientation 

The antisymmetry of r causes a complication. We have to decide which of the argu- 
ments of X comes first. On the other hand, this was expected, since we did not indicate 
what direction of flux across the surface element we consider to be positive. Pick any 
three linearly independent vectors ui , i; 2 , V 3 and declare every other triplet of linearly 
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Figure i . Linearity, motivation. 



independent vectors , W 2 , W 3 to be positively oriented if the determinant of the ma- 
trix representing them as linear combinations of , U 2 , 1^3 has a positive determinant. 
Since the determinant of a matrix is an invariant quantity, the orientation of a triplet 
of vectors is well defined. The orientation of the body induces an orientation on the 
boundary, i.e., a way to decide in which order two tangent vectors should appear as 
arguments of t. If m at X on the boundary is any vector pointing outwards, we say that 
the pair of tangent vectors u, u; is positively oriented on the boundary if the triplet 
u,v,w is positively oriented in the body. Hence, only positively oriented (or ordered) 
pairs of vectors may be used in r if we want to evaluate the flux out of the body. 

7.2.3. The flux form 

Let X be a point in the body. The flux object 7 at X should assign the flux rate 7 (m, i>) 
through an arbitrary infinitesimal triangle defined in terms of some two vectors m, i; at 
X. That is, unlike the flux density form, we do not know a priori the plane where the 
vectors are situated. We expect that in case the vectors u and t; are on a tangent plane 
for which the flux density form is r then 7(w, i>) = r (m, u) for consistency. Hence in 
particular 7(w, v) = — 7(u, w), J(au, v) = aJ{u, v) and 7(i;, v) = 0. Consider the 
infinitesimal tetrahedron X, A, fi, C having a vertex at X and generated by the three 
vectors u,v,w originating at X) (see the tetrahedron with solid lines in Figure 1). 
Assume we choose a right-hand orientation in space so that, for example, the pair of 
vectors w, i; on the XAB face is negatively oriented. Since the tetrahedron’s volume 
tends to zero faster than the total area of its boundary in any configuration of the body, 
the balance of the property p implies that the total flux should vanish. Hence, taking 
into account the orientations of the various faces, we have 

7(u, u) -f 7(u, w) + 7(m, i; -h u;) — 7(w -f u, u;) = 0. 
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If we carry out the analogous calculations for the tetrahedrons X, B,C, E 
(dashed lines) and X,C,D,E (dotted lines) we obtain the additional equations 

7(m, w) + J{u + u, u;) + 7(u, u) — J{v, u; + w) = 0, 

J{w, u) — J{v + u;, w) — 7(u, w) + J{v, w -j- u) = 0. 

When we add the three equations we obtain the balance of flux for the prism ABC D EX 
and using the antisymmetry of J we arrive at 

7(m, V w) = J(u, v) + J(u, w). 

Hence, it follows that the flux object J is an antisymmetric bilinear tensor on the 
three-dimensional vector space of vectors at X. Since the dimension of the space of 
antisymmetric bilinear tensors is three, it is clear that J is uniquely determined by 
the flux densities pertaining to three independent planes. 



7.2.4. Stresses 

We now want to use the flux object in order to define the material stress object. Our 
starting point is the expression for the virtual power 



L 



b-udv-\- I t'ji-uda= I 

n JdK H 



b • u -f / T(n) • u da 

JdIZ 

= / hudv+ I T^(u)-nda, 
Jn Jdn 



Jn Jdn 

where u denotes the virtual velocity field, b is the body force field, and t'ji is the 
surface force field that depends on the region IZ under consideration through the 
Cauchy formula t'ji = T(n) using the Cauchy stress T. We regard the last integrand 
as the flux of power through the boundary dTZ so that the foregoing discussion holds 
for the energy extensive property. Hence, the corresponding Cauchy flux vector field 
is = T^(u), which depends linearly on u. We denote by the corresponding 
flux density so that 

w) = nda = T^(u) • n da 



is the flux of power through the infinitesimal surface element da associated with the 
vectors v and w. Because depends on the spatial velocity field u linearly, this 
property is inherited by the flux form In other words, one has to show that if 
u is the invariant velocity vector represented by u, then the dependence of on 
u is linear. If we write w) to emphasize the dependence on the particular 

velocity u so w) = T^(u) -n da, then the linear relation between u and u 

implies that 

-h b2Ui){v, w) = T^(b\u\ 4 - i?2U2) • n da 

= Z?iT^(ui) -nda-\- b2T^{n2) -nda 
= b\J^^\ui){v, w) + b2J^^\u2)(v, w), 



which yields the required linearity, since the vectors v and w are arbitrary. In other 
words, there is a linear mapping a at X such that = cr(u). This mapping is a 
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generalization of the stress tensor and it is a mapping that takes the virtual velocity 
at X and gives the power flux. 



7.2.5. Conclusions 

We conclude from the above introductory remarks that multilinear mappings, par- 
ticularly the completely antisymmetric or alternating ones, play a crucial role in the 
analysis of fluxes and stresses. 



7.3. Notation 

The language used for the formulation is that of differential forms on manifolds 
and their integration (see [1]). We recall that spacetime ^ is assumed to be an m- 
dimensional orientable manifold with a definite orientation chosen. An m-dimensional 
compact submanifold with boundary 7?. of ^ will be referred to as a control region. A 
frame is a diffeomorphism of spacetime onto the Cartesian product R x where R 
represents the time axis and represents the space manifold — a “slice” of spacetime. 

Although our treatment does not use a frame, we will sometimes describe the physical 
meaning of variables in the setting of a frame. Thus, if a frame is given, a control 
region in spacetime may be conceived as a time- varying control region in space. The 
tangent space to at jc will be denoted by Tx^ and the tangent bundle to ^ will 
be denoted by T^. A typical coordinate system in a neighborhood of x e ^ will 
be denoted by {jc^, . . . The induced local basis for the tangent spaces will be 
denoted by {d/dx ^ }, and the basis for the dual space will be denoted by [dx ^ }. 

Denote by the vector space of r-alternating multilinear forms on the 

tangent space Tx^. The bundle of r-forms is denoted by f\^ {T*^) so that an r- 
differential form O' is a section O' : ^ -> An m -form yo may be interpreted 

intuitively as a density of an extensive property such that for a: € ^ , Pjc (w i , . . . , ) 

is interpreted as the amount of the property in the infinitesimal element generated by 
the positively oriented collection of tangent vectors {wi, . . . ,w^} at a:. Such a form 
may be integrated over a control region to give p. We recall that the dimension 
of is one. An m-form that does not vanish anywhere on is a volume 

element. The existence of a volume element is equivalent to orientability of ^ and a 
choice of a volume element determines an orientation on 

An (m — l)-form a> may be integrated over (m — 1) -dimensional oriented sub- 
manifolds of in particular, over boundaries of control regions. The orientation of 
the boundary of a region is induced by that of ^ and the outward-pointing vectors. 
We recall that the dimension of is m and cox may be evaluated on the 

various collections of m — 1 tangent vectors {u\, . . . ,Wm-i} situated on the various 
(m — 1) -dimensional subspaces ofTx^, or hyperplanes, at x. Hence, the evaluation 
of (o for m collections of m — 1 vectors such that the various collections are situated 
on linearly independent hyperplanes determines co uniquely. 
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We denote the exterior derivative of an (m — l)-form co as dco. The m-form dco 
satisfies Stokes’s theorem, i.e., for each control region TZ, we have 

f 1 * 0 ) = f dco, 

Jdn Jn 

where l : dTZ ^ is the natural inclusion and l* is the pull-back of forms it induces 
(i.e., its adjoint) so that 

for Ml Mm-1 e TxdTl. 



7.4. Material structures and extensive properties 

Consider the conservation of an extensive property p in spacetime The balance 
of the property p is assumed to be given in terms of boundary integrals on the various 
control regions. Specifically, it is assumed that for each control region TZ there is 
an (m — l)-form X'ji on dTZ, the flux density. The integral rqi is interpreted as 
the flux of the property out of the control region in spacetime relative to the positive 
orientation induced on dTZ by the orientation on ^ and the outward-pointing vectors.^ 
Regarding X'ji as the value of a set function defined on the collection of con- 
trol regions, Cauchy’s postulates of continuum mechanics can be generalized to the 
present setting (see [6], [8] for the details) to yield an analogous generalization of 
Cauchy’s theorem. The traditional assumption of dependence on the normal to the 
boundary is replaced by dependence on the tangent space to the boundary. The theo- 
rem states that there is a unique (m — l)-form 7 on to which we will refer as the 
flux form associated with the property p, that induces the flux densities for the various 
subbodies. Thus, in the absence of a metric tensor, the value of the flux at a point 
is an (m — l)-altemating tensor rather then a vector. It is noted that both objects are 
elements of m-dimensional vector spaces. The flux form J induces flux densities X'ji 
on the boundaries of the various control regions by the analog of the Cauchy formula 

TTZ{U\,. = J(Uu...,Um-\)- 

Equivalently, we may write 



rn = 

In other words, the flux form is restricted to a hyperplane through x to give the flux 
density associated with it. As expected, one can determine the flux form uniquely by 
knowing the flux densities corresponding to m independent hyperplanes. 



^ In case a frame is given, the flux density through a space-like slice is interpreted as the density of the 
property p in space. The flux through a hyperplane containing d/dt, the natural tangent vector to the time 
axis, is interpreted as the classical flux density of p into the corresponding region in space y. 
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Usually, it is assumed that there is a source density term s for the property, an 
m-form on so that the conservation equation of the property is 




for every control region TZ. In this case, Stokes’s theorem and the generalized version 
of Cauchy’s formula imply that 




for every TZ. Thus, the conservation equation may be written in a differential form as 
dJ = s? 

We now consider the particular case where a volume element 9 is given on ^ . 
In physical terms such a volume element may be interpreted as a positive extensive 
property that fills spacetime, e.g., mass density. We recall that, given the m-form 9x 
and a vector field w, the (m — l)-form w j the contraction of 9 with m, is defined by 



= 0(u, Wl, . , . 



for every collection of m — 1 vectors. For a given volume element, at each point this 
relation is an isomorphism Tx^ ^ ^se 

m-l 

is: /\{T*^)->Tx^ 

to denote its inverse. Thus, if m = ieiJ), then uj9 = J, Let 9 be represented 
locally by 

r(x^)dx^A- • • Adx^, 

where a “wedge” ( a ) denotes the exterior product sodx^A.,. Adx^ is a single basis 
element of Then ie{J) is represented by 



r 

If / is a flux form of an extensive property p and a volume element is given, we 
will refer to m = ie{J) as the kinematic flux associated with p. The kinematic flux is 
the analog of the 4- velocity field. In other words, in order that the flux is represented 
by a vector field, one needs the additional structure of a volume element. If ^ denotes 



^Again, if a frame is given on spacetime, then the time component of J (that involving dx^ A • • • A 
where x^ is the time coordinate) is interpreted as the density in space of the property p and the term in 
dJ containing it is the time derivative of that density. The space-like components of J (those associated 
with the basis vectors dx^ A • • •) describe the three-dimensional flux and the terms 'mdJ involving the 
space-like components make its (m — l)-dimensional divergence. In a particulM" frame, for every time t, 
the classical conservation law has the integral form 




L 






where here 7Z is interpreted now as a region in space (a slice of spacetime) and P'ji is the rate of change 
of the density of the property — a 3-form. In order that the previous Cauchy assumptions apply, it is usually 
assumed that p'ji is actually independent of TZ. 
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the Lie derivative, then the differential balance equation can now be written in the 
form-^M^ = s. 

A flux form J induces at each point jc e ^ a one-dimensional subspace, the 
flux space, Ex as follows. The flux space is the intersection of all hyperplanes for 
which the induced flux densities vanish. A tangent vector u belongs to the flux space 
if and only if ujJ =0. This is analogous to the velocity vector at a point. The flux 
density will vanish through any hyperplane containing it. Alternatively, the flux space 
is the one-dimensional vector space obtained by the relation u = ie (J) when the flux 
form J is kept fixed and the volume element 9 is allowed to vary. The collection E 
of the subspaces Ex for the various elements of spacetime will be referred to as the 
flux bundle. 

Since the flux bundle is one dimensional, for each x e ^ there is a unique 
one-dimensional submanifold, the integral submanifold 7 of ^ so that Ex = TxY 
(see [1]). We will refer to 7 as a worldline. Thus, even in the general case where the 
flux cannot be represented by a vector field, its integral submanifolds and worldlines 
may be defined. Following the ideas of Capriz & Trebeschi [3] we can now identify a 
body point with a worldline, a material body with a collection of worldlines, and the 
material universe with the collection of all worldlines (see [10] for further details). 

It is noted that within this general framework, one can easily define the flow 
potential. If the source s vanishes, then the flux satisfies the equation ^7 = 0. Hence, 
at least locally, there is an (m — 2) -form the flow potential, such that J = dxjr. 

7.5. Stress theory for manifolds 

We consider a /x-dimensional vector bundle n : W ^ over ^ . The vector bundle 
is interpreted as the bundle of generalized velocities over ^ For the case of a general 
differentiable manifold one cannot model forces by vectors and integrate them. Forces 
should be defined in terms of the power they produce for virtual velocities. Hence, 
the stress theory for manifolds, presented in [7, 8], considers for each compact m- 
dimensional submanifold with boundary 7^ of ^ a linear functional of the generalized 
velocity fields containing a volume term and a boundary term of the form 

Fn(w) = f + f t7j(w). 

JtZ JdTl 

Here, u; is a section of W interpreted as a virtual velocity field, h'ji, the body 
force, is a section of L{W, /\^(T*TZ)), and t'ji, the boundary force, is a section of 
L(W, (7*372.)) so the integrals make sense. The functional F'ji is interpreted 
as the force, or power, functional and the value F'ji(w) is classically interpreted as 

^In classical continuum mechanics, if ^ is interpreted as the physical space (a slice of spacetime), then 
in many cases W is the tangent bundle If ^ is interpreted as the material body, then W is usually 

the pull-back of the tangent bundle of the space manifold under the configuration mapping that embeds 
the material universe in space. This is the interpretation used in previous works (e.g., [5, 8]). In either 
case, a section of the bundle tt is interpreted as a generalized velocity field from the respective Eulerian 
or Lagrangian point of view. 
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the power of the force for the generalized velocity field w. Thus, for example, the 
boundary force is applied to the virtual velocity to produce the power flux density 
pertaining to the region under consideration. 

The Cauchy postulates for forces are analogous to those pertaining to the scalar- 
valued properties (see [8]) and will not be described here. Again, the Cauchy theorem 
asserts the existence and uniqueness of the stress object that induces the various 
boundary forces. The resulting generalized stress a is a section of the vector bundle 
L(W, i.e., the value of a stress at a point is a linear mapping that 

assigns an (m - 1) -multilinear mapping on the tangent space to any virtual velocity 
at that point. The resulting (m — 1) -multilinear mapping a (w) is interpreted naturally 
as the flux form for the power associated with the given generalized velocity. The 
Cauchy formula assumes now the form 

tnW = o 

and we will also write a (u;) for a o u; and C(cr) for o a. Thus, the Cauchy formula 
may be written as t'ji = L*(cr) in analogy with the scalar case. In other words, the 
power flux density associated with the generalized velocity m; at G dTZ is the 
restriction of the flux obtained from the stress. 

Using Stokes’s theorem, the action of may now be rewritten using an integral 

over TZ as 

FnM = I {dcr{w) 

Jn 

It follows from this representation that the power depends on both the values of w 
and its derivatives. 

Assume that {x\w^) are local vector bundle coordinates in a neighborhood 
7t~^{U) C W, U C ^ with local basis elements [ca] so that a section of W is 
represented locally by w^Ca. Then denoting the dual basis vectors by [e^], a stress a 
is represented locally by 

^a\ k ^ dx^ A- • • /\dx^ " Adx^ , 

where a “hat” ( ) indicates the omission of an item (an index or a factor) so that 

{dx^ A • • • Adx^ A • • • Adx^~^\ / = 1, . . . , m} 
is a basis for /\'”“"^(r*^). The value of o{w) is represented locally by 
a . t w^dx^ A- • • Adx^ A' • • Adx^ . 

7.6. Stress-energy tensors 

We now combine the points of view of the previous two sections. That is, we consider 
stresses for bodies whose material structure is induced by an extensive property p. 
Although we do not have a velocity field over a manifold for the case of such bodies in 
general, we can use the the flux form of the property as a generalization. In particular. 
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this is motivated by the fact that a volume element induces an isomorphism between 
the space of flux forms and the space of tangent vectors at each point. 

Thus, we consider stress theory on spacetime ^ where we set 

m-l 

IV = 

In addition, adopting the point of view of Section 7.3, we formulate the theory in 
spacetime so there is no need for the term containing hqi and it is omitted. Therefore, 
we write 

FnU) = I t7^(/), 

JdU 

where the boundary term i'ji is a section of (T* STZ)) . Thus, 

we conclude from the results of Section 7.4 that the stress or is a section of 

m— 1 m— 1 

The situation may be described generally as follows. We started with an ex- 
tensive property p given in terms of the flux densities for the various control 
regions TZ in spacetime. The source term for property piss and assuming the Cauchy 
postulates are satisfied the property p has a flux form 7. We now consider a second 
property, the q property, whose flux densities for the various control regions and 
source term satisfy the balance equation 




Again, assuming the Cauchy postulates hold for the property q, we have the corre- 
sponding flux form satisfying and the balance equation has the 

differential representation 

We will say that the property ^ is a resource for the property p if the flux density 
depends pointwise linearly on the flux form J of the property p. Thus, there is a 

section i'ji as above such that = tniJ)- 

In this framework, the Cauchy theorem implies that 

= tn(J) 

= t*(or(7)), 

for the inclusion i of an arbitrary region, so 7^^^ = o' (7). In other words, the value 
Gx of the stress at a point is the linear transformation that assigns to the flux of the 
property p the value of the flux of the resource q used by p. The source term for the 
property q is now given by s^^^ = da{J). 

Naturally, in what follows we will be concerned primarily with the energy 
resource. 
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7.7. Representation of stress-energy tensors 

Since the stress-energy tensor is now a section of 

m— 1 m— 1 

L(/\(r*^), /\(T*^)), 

it is locally represented with respect to a local basis of by a matrix. We 

will make below some further observations regarding the representations of stresses. 

Assume that a volume element 0 is given on Then we may use the vector 
bundle isomorphism 

m-l 

ie: 



to represent the section a of by a section or of 

L{T^ , T^) satisfying a oio = ie o a. 

Let us consider the relation between the local representation of a and the local 
representation a j dx^ <S> of a. To represent a locally, we will use the notation 

for the basis element dx^A • • • Adx^ A • • • Adx^ of Thus, the flux 

form J is represented locally by Jte^ , and the stress is represented locally in the form 
aJ ej (g) e ' , where {ej} is the dual basis to [e ^ }. 

If the volume element 9 is represented locally by r dx^ A — - Adx^, the action 
of ie is given locally by 



Jie^ Ji 

^ r 



1 . a 






(We use the summation symbol since the summation convention cannot be used on 
the right.) Thus, ie(cy{J)) is represented by 



“ r 




and d{ie(J)) is represented by 




I 



Hence, the relation between a and a is represented locally as 

It is interesting to note that the volume element does not enter the last relation 
and in fact we can write a natural isomorphism between the bundles L(/\'”“^ 

and L{T^ , T^) as follows. Consider the tensor product <8>^ 
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. This tensor product is naturally isomorphic to L(T^ For an element 
a = aj 0* 0 Vj in the tensor product 7*^ 0^ , vj e 7^^, 0* e T*^, set 

m— 1 m—l 

a: /\(r*^) ^ 
by 

cr(7) = ojvjj (<f>‘ A 7) (*) 

= a/(0'(i;^)y-<^' a(u,-j7)). 

We note that a is indeed linear in J. Since a depends linearly on the u* and on the 
0^ , it depends linearly on the elements of the tensor product. 

For the local coordinates {jc‘ }, let us determine the stress a induced by the 
linear mapping a e L(T^,T^) represented locally by the tensor a\dx' ^ . By 
definition, a{J) is represented by (the sum on i is explicitly written) 

tj (dx^ A 7) = ^ gjy-Tj (^dx^ A (Jk dx^A • • • Adx^A • • • A^jc'”)) 
i i 

= E ^ i ((- 1)' + * Ji dx^ A . . . Adx"') 

i 

= ^6r-J (— dx^ A - • • Adx^ A - • • AJjc'” 
i 

= Y^ai(-iy+jjiej 

i 

Hence, the matrix representing a is &j = (— 1)**^^ . We conclude that equation (*) 
is indeed the natural, invariant representation of the isomorphism 

m—l m—l 

L{/\{T*‘2^), /\ (T*^)) ^ L{T‘W, r^). 

This motivates even further the interpretation of the Cauchy stress as a transformation 
operating on the flux or velocity field of the property p to give the flux form for the 
energy or velocity of the generalized energy points. 

7.8. Electromagnetic stresses 

The theory of the Maxwell stress-energy tensor of electromagnetism does not follow 
usually the lines of stress theory in continuum mechanics. As an example for the use 
of the foregoing analysis, we can now present a definition of the Maxwell stress- 
energy tensor. Furthermore, while electromagnetism utilizes the metric of relativity 
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theory, we will assume here only the existence of a volume element on the four- 
dimensional ^ . The following setting is also independent of any relation between the 
Maxwell 2-form and the Faraday 2-form, such as the aether relations. The extensive 
property under consideration is of course the electric charge and J is the charge- 
current density — a 3-form. The conservation of charge implies that = 0 and the 
Maxwell 2-form 9 is a flow potential for the flux form so 7 = For a 1-form A, 
the vector potential, the energy source density is A A 7. It follows that the Faraday 
2 -form f = JA satisfies Jf = 0 . 

Thus, assuming that a volume element 0 is given on we set u = ie{J) and 
define the stress-energy tensor as the section a of 

m — 1 m— 1 

L(/\(r^), /\(r*^)) 

by (cf. [12, p. 36]) 

<^{j) = Af - (i^(7)jf) Afl. 

Alternatively, using 



A f) = (wjg) A f + g A (wjf). 



the definition of the electromagnetic stress-energy tensor may also be written as 
a (7) = ie{J)^ (fl A f) - 2{ie{J)^ f) A p. 



Note that the matrix of the stress with respect to the natural basis of the space of 
(m — l)-forms is related to the usual matrix of the stress-energy-momentum tensor 
as discussed in the previous section. 

Following the traditional formulations, we write the local representation f/y dx^ A 
dx^ of the Faraday 2-form f in the form 



{fo-} = 



{flo-} = 



0 


-Ex 


-El 


-El ' 


Ex 


0 


Bi 


-Bi 


El 


-Bi 


0 


Bx 


El 


Bi 


-Bx 


0 ; 


tation 


Qijdx' 


A dxj 


for the ' 


0 


Hx 


Hi 


Hi 


-Hx 


0 


Di 


-Di 


-Hi 


-Di 


0 


Dx 


-Hi 


Di 


-Dx 


0 



For simplicity of notation, we assume that locally the volume element 6 is of the 
form dx^ A • • • A dx^. Then m = ie{J) is represented by = (— 1)*"*"^7/. With this 
notation, the matrix } representing the stress-energy tensor is 
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where 












{a^} 



HiBi + H2B2 + H3B3 + DiEi + D2E2 + E3D3 
HE3H2 - E2H3) 

2(E3Hi - E1H3) 

2(E2Hi-EiH2) 

2(BsD2 — B 2 D 3 ) 

HiBi - H2B2 - H3B3 + E\Dx - E2D2 - E3D3 
2 {ExD2-B2Hx) 

2(£i£>3 + B3//1) 

2 {B 3 Dx-BxD 3 ) 

2{BxH2-E2Dx) 

-HxBx + H2B2 - H3B3 -ExDx+ E2D2 - E3D3 
2 (— £ 2^3 — B3H2) 

2{B2Dx-BxD2) 

2(,BxH3 - E 3 DX) 

2(-E3D2 - B2H3) 

—HxBx — H2B2 + H3B3 — ExDx — E2D2 + E3D3 



We now consider the energy source term d<r{J). Using u = ie{J) one obtains 

dor ( 7 ) = d^(je( 7 )j 0 ) Af - (le( 7 )jf) A0^ 

= d(«-i 0) A f - (m J 0) A df + (m J f) A d0 - d(UJ f) A 0 

= d(« J 0) A f + (m-i f) a 7 - d(w-i f) a 0, 

where Maxwell’s equations were used to arrive at the last line. Using the identity 
d(u-^ oi) = — w-J da, for any differential form a, we have 

da(J) = (wjf) A J + - u^dQ) A f - (^uf - u-idf) A g. 

Finally, since m j 7 = 0, Maxwell’s equations give for the energy source term 

dcx{J) = (M J f) A 7 + A f - (^„f) A 0. 



It is noted that the term (m_i f) A 7 represents the power of the Lorentz force. 
In addition, in the classical formulation where a metric is available and g = *f (* 
denotes the Hodge operator), the terms (-Sfufl) A f and A g are equal and the 
energy source density contains the power of the Lorentz force (and energy balance) 
only. For an analogous expression where the constitutive relation between g and f is 
not specified but a metric is used, see [4, p. 91]. 
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8. Algebra, Geometry, and Computations 
of Exact Relations for Effective Moduli of 
Composites 

Yury Grabovsky* 



Abstract. In this paper we will review and extend the results of [21], which cov- 
ered the case of 3D thermopiezoelectric polycrystals. In that context the settings 
of conductivity, elasticity, pyroelectricity, piezoelectricity, thermoelectricity, and 
thermoelasticity can be viewed as particular cases. We will consider a class of 
composites more general than polycrystals, where the set of allowable materials 
is not constrained in any way. In addition, the tensors of material properties are 
not assumed to be symmetric — an assumption we made in [21]. For example, the 
Hall effect for conduction in a weak magnetic field is described by a nonsymmet- 
ric conductivity tensor. We explain the step-by-step process of finding all exact 
relations for the simple example of the 2D Hall effect. The paper concludes with a 
discussion of new algebraic and geometric questions posed by the theory of exact 
relations. 



8.1. Introduction 

Composite materials play an increasingly important role in our everyday life and tech- 
nology. They are used everywhere from skis and golf clubs to sensors and actuators 
in high-tech components. By “composite” we mean a perfectly bonded mixture of 
two or more materials, where the mixing occurs on length scales much smaller than 
human size but much larger than interatomic distances. The physical properties of 
a composite (thermal, electric, elastic, etc.) are described by a tensor — the effective 
tensor of a composite. In order to create a composite with desired properties two 
basic problems become important: prediction of the effective properties of composite 
materials and determination of the effective tensor of a given composite by as few 
measurements as possible. 

The problem of analytical prediction of effective properties of composite mate- 
rials is important for both practical and theoretical points of view. Understanding the 
effective behavior mathematically may help save many costly and time-consuming 
measurements or may prevent spending time and money on many trial-and-error ap- 
proaches to material design. Recent years have seen a lot of success in that direction. 
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I would like to describe in this chapter a theory of exact relations for effective moduli 
of composites. The theory combines algebra, geometry, analysis, and mechanics in 
a beautiful symbiosis. It also achieves a high level of generality encompassing con- 
ductivity, elasticity, piezoelectricity, and many other coupled problem contexts. The 
theory and many of the physical results of this paper are also discussed in the new 
book by Milton [39]. 

The principal difficulty in the prediction of effective properties is the universally 
recognized fact that the effective tensors of composites depend on the microstructure 
(spatial arrangement of component materials) in general. Therefore, the object of 
importance is the set of all possible effective properties of a composite made with 
given materials taken in prescribed volume fractions (the so-called G-closure set). 
Unfortunately, aside from a few particular cases the G-closures are extremely difficult 
to compute analytically. 

Exact relations are dependencies between various material properties that “sur- 
vive” homogenization. For example, if we mix two isotropic elastic materials the 
resulting composite does not have to be isotropic. So, the relations defining isotropy 
are not preserved under homogenization. Yet isotropy survives if both isotropic ma- 
terials have the same shear modulus. In fact, the composite will also have that same 
shear modulus, no matter how the materials are mixed. Statements about effective 
tensors that do not depend on the geometric arrangement of the constituent materials 
are called exact relations. The example described above is the exact relation due to 
Hill [22, 23] in 1963. 

Exact relations and, more generally, G-closures provide information about all 
composites regardless of their origin, details, and complexity of microstructure. If we 
want to move beyond the G-closure and obtain a more detailed information about the 
effective tensors of composites we need to distinguish composites according to the 
type of microstructure they have. 

One class of composites is where the (possibly infinitely many) scales of in- 
homogeneity are well separated. For example, imagine a material that looks homo- 
geneous to the naked eye. Yet when we examine this material under a microscope, 
we may observe that it is, in fact, composed of several seemingly homogeneous 
materials, each of which, upon examination under a more powerful microscope, is 
observed to be composed of other seemingly homogeneous materials and so on for 
several or possibly an infinite number of steps. We also assume that on each step of 
our microscopic examination of an apparently homogeneous component, we find a 
rather simple geometric arrangement of phases, each occupying regions of approxi- 
mately the same size. This broad class of composites is adequately described by the 
homogenization theory (see, e.g., [5, 24]), which we briefly review in this paper. 

On the other end of the spectrum are disordered media where the microstructure 
at one point is almost uncorrelated to the microstructure at a somewhat distant point. 
In this situation it is usually assumed that the composite is random, and the stochastic 
approach [18, 30, 42] works well here. 

The third and very important type of composite has a power law distribution 
of sizes and properties of the constituents. The power law is the telltale sign of 
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self-organized criticality — a theory proposed by Bak, Tang, and Weisenfeld [3, 4]. 
According to this theory many open systems with constant influx of energy self- 
organize into poised states on the border between order and chaos. Examples include 
earthquakes and sandpiles, extinction of species and traffic jams [2]. I can speculate 
that the structure of real geological media is critically self-organized. The upper crust 
of our home planet can be considered an open system with constant energy flow from 
inner layers of Earth in the forms of seismic and volcanic activity. As a result the 
Earth’s crust is a complicated multiscale structure with inhomogeneities on a vast 
range of scales. Under the microscope we can see tiny particles making up clay. With 
a naked eye we can see small individual grains of sand. We can also see boulders — the 
heterogeneities are on the scale of meters. The scale staircase goes on and on. The 
Earth’s crust is a heterogeneous mixture of materials on a continuous spectrum of 
length scales from microscopic to global (continental plates, oceans). I would like 
to call such composites critical. The appropriate mathematical tools for a rigorous 
discussion of effective properties of such composites are only beginning to emerge. 
On the one hand, there is a nonrigorous renormalization group approach [27, 44]. 
On the other, there is a more rigorous micromechanics-hased approach leading to 
nonlocal constitutive laws [10, 11, 41]. Yet, hydrologists, for example, have to deal 
with the effective hydrolic permittivity of such media as a rule [8]. 

As we mentioned earlier, we will focus on the microstructure-independent aspect 
of the theory of composite materials, which applies to all composites equally well. 
The discussion of the different types of composites was needed in order to place the 
subject of this paper in a broader context. 



8.2. G-closures 

We start this discussion with the example of conductivity before going on to the 
abstract framework encompassing a variety of physical contexts. 

The conduction of the electric current is described by two fields, the electric field 
e and the current field j . The two fields satisfy differential constraints: The electric 
field is curl-free, the current field is divergence- free. The two fields are related via a 
tensorial Ohm’s law. Thus, we have the following equations: 

V X e = 0, V • 7 = 0, 7 = ae, (1) 

where the 3 x 3 matrix a is symmetric and positive definite. 

An n-phase composite made with n materials <r i , . . . , is a Borel measurable 
matrix field <r(jc), such that a(x) e D = (ai, . . . , a„). An n-phase polycrystal is a 
Borel measurable matrix field a (x), such that a(x) e D = IJ SO(3)-aj,In general, 
we fix a set D (finite or infinite) of materials and consider the set 

V = {a(x) e L^(Q) | a(x) e D a.e. x € Q], (2) 

From the applied point of view one needs to solve the elliptic boundary value 
problem in a domain Q occupied by the composite 
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I V • (or(x)V</>) = /, xeQ, 

[0 = 0, xedQ. 

However, if a{x) has a very complicated geometry, the numerical solution of (3) 
is not feasible. The fruitful idea is to represent the local conductivity tensor a(jc) 
as a member in a sequence a^(jc) for small €. As € 0 the length scales in the 

microstructure become more and more separated. In the limit as € ^ 0, the material 
properties are described by a homogenized tensor a*(x), which may be much simpler 
than the local conductivity tensor. This is especially true in the case of composites 
with inhomogeneities on well-separated length scales. If <r*(jc) is particularly simple 
then (3) may become solvable numerically. 

Putting the practical questions aside and turning to mathematical justification 
of the homogenization procedure, we need to answer the following question. Given 
a sequence of bounded measurable functions cr^(x), is there a sense in which we 
can say that -> a*, so that the solutions 0^ of (3) with a = cr^ converge to the 
solution 0* of the homogenized equation? Such a notion was proposed by Spagnolo 
[45, 46] and further developed by De Giorgi and Spagnolo [7]. Murat and Tartar 
[40] (the English translation of the French original) extended G-convergence to the 
case of nonsymmetric tensors a and proved that G-convergence of symmetric elliptic 
operators in (3) implies the convergence of fluxes 

a"V0^ <r*V0* (4) 

weakly in 

The nonsymmetric tensors of material properties may arise in a variety of con- 
texts. For example, the Hall effect in the electric current conduction in a very weak 
magnetic field is governed by the same basic equations of conductivity (1), except 
the tensor a is no longer symmetric 

a = -h7T(r), 

where as is the symmetric and positive definite conductivity tensor, r is the Hall 
vector, and tt is the “cross-product” mapping between vectors in and 3x3 skew 
symmetric matrices such that for any {a, jc} c we have 7t{u)x = u x x. 

Following Murat and Tartar, the name H-convergence is attached to the kind of 
G-convergence that is appropriate for nonsymmetric material tensors. 

Definition 1. A sequence of not necessarily symmetric tensors a^ H-con verges to or* 
if for any / in (3), 

(i) 0^ ^ 0* weakly in Hq(Q), 

(ii) a^Vcj)^ a*V0* weakly in L^{Q). 

Murat and Tartar [40] noticed that for symmetric matrices a the second condition 
is redundant. It follows from the first. 

The key result here is the compactness theorem proved by De Giorgi and Spag- 
nolo for the case of symmetric a and by Murat and Tartar in general. The theorem 
states that any sequence of uniformly positive definite and bounded matrix fields 
a^(x) contains an H-con vergent subsequence. In this connection the problem of the 
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closure of the set V, defined in (2) with respect to the H-convergence, becomes impor- 
tant. In order to formulate a fundamental result, we must first give the basic nontrivial 
example of an //-converging sequence . Suppose that the matrix- valued function 
<r {y) is periodic with a parallelepiped of periods 2 = [0» 1 ]^ • We further assume that 
a{y) is uniformly positive definite and uniformly bounded. Then cf^{x) = <r(x/€) 
H-converges to a constant positive definite matrix or*, defined via a solution of the 
so-called periodic cell problem. We will write the cell problem in a form that will be 
useful later: 



V-7=0, Vxe = 0, j{y) = (T{y)e{y), (5) 

where all functions and differential operators are g-periodic. Equations (5) have a 
unique solution if we fix the mean value of the electric field e over the period cell. 
Suppose {e) = e*, where (•) denotes the average over the period cell Q. Then the 
unique field j satisfying (5) depends linearly on e*, and therefore there exists a 3 x 3 
matrix a* such that 

= (7). (6) 

Definition 2. Let D be the set of matrices representing conductivities of the materials 
constituting the composite. Then the G-closure [31, 49] G{D) of the set of materials 
D is the set of all effective tensors <r* of periodic composites made with materials 
from the set D. 

Kohn and Dal Maso realized that the set G{D) is sufficient to describe the H- 
closure (closure with respect to the H-convergence topology) of the set V defined 
in (2). The result has been recently rigorously proved in a very general context by 
Raitums [43]. The theorem states that the matrix field a*(jc) is in the H-closure of V 
if and only if a*(x) e G(D) for a.e. x e Q. 

Our primary interest is the G-closure sets of materials in a variety of physical 
contexts. We must mention that the explicit form of the G-closure is known only in a 
very few cases [15, 19, 20, 29, 32, 33, 34, 35, 37, 49]. Only the paper [15] characterizes 
G-closure sets for arbitrary sets of materials D for 2D conducting polycrystals. 

At this point, we would like to make several remarks. Observe that the G-closure 
problem is the problem of computing a subset in a finite dimensional space, given 
another subset of the same space. Yet, it involves solving a periodic PDE problem 
(5). It would be nice to have a completely geometric description of how to construct 
G(D) knowing D, as was done in [15] for 2D conducting polycrystals. The first step 
towards such a geometric description in the general case was made in [16] for sets 
closed under lamination (the cornerstone of this paper), but a major new geometric 
breakthrough is still needed for the method to be truly useful. 

Another remark is that G-closure lives in a rather high dimensional space (6D 
for conductivity, 9D for the Hall effect). Therefore, even if we have exact formulas 
for the boundary of G(D), they are not immediately useful in applications without 
further (often very complicated) algebra. Instead, a more useful goal would be an 
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efficient numerical algorithm for computing quantities like 

fo = mn f(a) 
aeG{D) 

for a class of functions /. For each / the answer is a number fo — something an 
engineer can relate to. 

Getting closer to the subject of the paper, there is a fundamental dichotomy in 
the shapes of G-closure sets. Most G-closure sets (aside from single points) have a 
nonempty interior. In exceptional cases the G-closure sets lie on surfaces (of various 
codimensions). It is these exceptional cases that we call exact relations and it is these 
cases that we are after. The trick here is that we are no longer in the losing game of 
computing the boundaries of the G-closure sets. Instead, we just want to identify the 
surfaces that contain G-closure sets. To give a trivial example, consider the Hall effect 
and assume that we are mixing materials whose Hall tensors are all zero. From the 
physical point of view, it is obvious that the Hall effect will not arise in a composite if 
every component does not have it. Geometrically, a 6D surface (all 3x3 symmetric 
matrices) in a 9D space (all 3x3 matrices) is an exact relation. In identifying this 
exact relation we do not need to compute any G-closure sets; yet the result is not 
entirely devoid of useful information. 



8.3. Hilbert space formalism 

Milton [38] has observed that the periodic cell problems in various physical contexts 
follow the same abstract pattern, which is best described using the language of Hilbert 
spaces (see also [9, 1 8, 25, 28, 42, 50] for similar formal approaches). Milton observed 
that regardless of the particular physics of the problem there are two fields: the inten- 
sity field E and the flux field J. They take their values in a certain finite dimensional 
tensor space T. For the Hall effect example, the electric field e is the intensity field 
and the current field j is the flux field (in fact, the names are purely formal, since the 
theory is symmetric with respect to the swapping of the fields). The fields are vectors 
so that T = for the Hall effect. All intensity fields and fluxes will be assumed to 
belong to the ambient Hilbert space H = L^{Q)^T. The most important observation 
of Milton concerns the general structure of the differential constraints satisfied by the 
fields E and /. If we denote by W = R 0 T the subspace of constant fields in W, then 
the Hilbert space H is split into the orthogonal sum H ^ ® J ®U, where S and J 

are the subspaces of mean zero intensity fields and fluxes, respectively. The subspaces 
E and J correspond to the differential constraints on E and /, respectively. For that 
reason Milton required that the orthogonal projection F onto £ be local in Fourier 
space. In other words, there is a degree zero function r(k) such that for each k eS^ 
the matrix F (k) is an orthogonal projection onto a subspace £k of T and such that 

[0, ifjfc = 0. 

The subspaces £k cannot be arbitrary. For example, they have to be of the same 
dimension. The basic 0(3) symmetry of our space implies that the subspaces £k are 
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permuted by rotations. In other words, the vector space T is not just a linear space, 
it is a representation space for the group 0(3). For t G T we denote the action 
of R e 0(3) by /? • T. With this notation we can write the basic property of the 
subspaces Sk'. R • £k = We use the same notation for the action of rotations on 
End(T), the space of linear operators on T. For any A G End(T) and any t g T 
we define R • Aby {R • A)r = R • (A(R“^ • t)). With this notation the basic 
property of the subspaces £k can also be written sls R • V(k) = r(Rk). This last 
formula tells us that the function F(k) is uniquely determined by a single matrix 
To = F((l, 0, 0)), which in turn is uniquely determined by a subspace So = f(i,o,0)« 
The only restriction on this subspace is that the subgroup O (2) of 5 O (3) that leaves 
the = (1, 0, 0) direction invariant should leave the subspace £q invariant. Thus, a 
standard representation theory of 5 O (3) can give us a list of all possible subspaces Sq 
satisfying all of our constraints. For example, for T = M^, there are just two proper 
subspaces Sq satisfying our constraints. One is R^i, and the other is its orthogonal 
complement. Thus, for k e there are two choices for the function T{k). Either 
r (fc) = ifc 0 A: or r (A:) = / — A: (g) fc, where / is a 3 x 3 identity matrix. The first choice 
corresponds to the equations (5), and so does the second, with e and j interchanged. 
Thus, in the context of conductivity, equations (5) are the only possibility consistent 
with a general Hilbert space framework described here. 

The composite microstructure is given by an mapping L(x) of Q into 
End(T). The function L(x) can also be viewed as an operator L mapping H = 
L^{Q) 0 T into H\ for any / g 7Y we have (L f){x) — L{x) f(x). 

The effective tensor L* is defined by analogy with (6): 

r = L*E\ ( 7 ) 

where E* = (E) and /* = (/). 

8.4. Lamination formula 

In this section, we derive a lamination formula that has the form of convex combi- 
nation. This formula was first derived by Milton [38] and independently by Zhikov 
[51]. Other linear lamination formulas were derived by Backus [1] and Tartar [48] 
for elasticity. Their idea was to rewrite the constitutive relation so that the continuous 
and discontinuous components of the elastic fields are separated. The laminate of two 
materials L\ and L 2 with layer normal n and volume fractions 0\, O 2 (^1 + ^2 = 1) 
is a periodic stricture with the period cell shown in Figure 1. 

More generally, a laminate is a function L(y) that depends only on y • n . In order 
to formulate the theorem we introduce the W- transformation of Milton [38]. Let Lq 
be a reference medium that is assumed to be positive definite but which otherwise is 
completely arbitrary. Let r'(w) denote the nonorthogonal projection onto the subspace 
LqS„ along the subspace J„. In fact, the projection F'(w) is well defined as long as 
Lo is positive definite on £„ (to ensure that LoS„ and have trivial intersection). 
One can check that 



r'(n) = Lo(/ - r(fi) + r(n)Lo)"'r(n). 
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First, define the S-transformation 

5(L) = (/-LLoi)-i. (8) 

Now define 

w„{L) = [S(D - r'(/i)]-‘ . (9) 

Theorem 1. Let L{y) be the laminate with the normal n and let Lq be an arbitrary 
positive definite reference medium (positive definite on £„ is enough). Then 

(W,(L(y))) = Wn(L% 

The proof is analogous to the proof of the corresponding theorem in [21, The- 
orem 3.1] for symmetric matrices. 

Proof Taking the average of the polarization field P = (L — Lq)E we obtain 
P* = (L* — Lq)E*. These relations can also be written using the S-transformation 
defined in (8): 

LoE = -S(L)P, LoE* = -S(L*)P\ (10) 

Applying the projection operator F' to P we obtain 

rfn)(P - P*) = -Lq(E - £*). 

Substituting the values for LqE and LqE* from (10), we obtain 
r'(it)(P - P*) = S(L)P - S(L*)P*. 

Solving for P and using the definition of the W-transformation, we obtain 
P{y) = W„(L(y))iW„(L*))-^P*. 




8. Exact Relations 



175 



Taking averages, we get 

P* = (W„(L(ymW„{L*)r^P*. 

Theorem 1 now follows because the constant field P* can be arbitrary. 



□ 



For the Hall effect £„ = Rn and the requirement that Lq is positive definite 
on £„ = R#i for all n is equivalent to Lq being positive definite. (For elasticity the 
corresponding condition is equivalent to the Legendre-Hadamard condition for Lq.) 
In this case, 



Lon • n 



( 11 ) 



The W- transformation maps the classical lamination formula [17, 49] into a 
convex combination. Namely, if L* is an effective tensor of a laminate made with 
materials L\ and L2 taken in volume fractions 0\ and 62 with lamination normal 
/I, then 



W„(L*)=0iWn(Li)-^02W„(L2). . ( 12 ) 



A corollary is that for any direction n a W;, -image of any set stable under 
lamination must be a convex set. The idea to use this property to study the geometry 
of sets stable under lamination is due to Francfort and Milton [16]. 



8.5. The main ideas 

Theorem 1 implies that if a set G is G-closed then W„ (G) is a convex set for all n eS^. 
If, on the other hand, W„(G) is a convex set for all n eS^ then the set G is L-closed 
(or stable under lamination). In almost all cases, G-closure and L-closure coincide; 
however, there is an example by Milton of a set that is L-closed but not G-closed 
[39]. In any case, from the practical point of view L-closure would provide a very 
good approximation to G-closure. Thus, the geometric problem of finding the smallest 
set G{D) containing D and such that Wn{G{D)) is convex replaces the problem of 
solving the cell problem. In the important poly cry stalline case, the geometric problem 
admits a very attractive formulation. (Is there an attractive answer?) 

Assume that if a material L is in D then all of its rotations {R L I « € 
5G(3)} are also in D. If we choose an isotropic reference medium then we can easily 
verify that 

R-W„(L) = Wr„(R-L). (13) 

It follows, then, that if a set G C End(T) is rotationally invariant and W„(G) is 
convex for a single choice of n then W„{G) is a convex set for all n e S^. Thus, 
we can fix one unit vector /i, for example, n = (1, 0, 0), and consider just one map 
W(L) = W„{L). The question is this: 

Find the smallest rotationally invariant set G(D) containing a given 
set D such that W(G(D)) is convex. 

We can ask a different question: 




176 



Yury Grabovsky 



Can we characterize convex functions g(K) such that the functions 
/(L) = g(W{L)) are rotationally invariant, f(R • L) = /(L) for 
all 50(3)? 

This question sounds very similar to the one answered by Chandler Davis [6]. Davis 
proved that a rotationally invariant function defined on hermitean matrices is convex 
if and only if its restriction to the diagonal matrices is convex. The method of Davis 
does not apply to our problem because the action of 50(3) on the W variables is no 
longer linear. Yet, a similar result is hoped for. We conjecture that there is a subspace in 
End(T) transversal to the action of the rotation group 50(3), such that the convexity 
of the restriction of g to this transversal is equivalent to the convexity of g. (Of course, 
the restriction of g to the transversal has to be invariant under the action of the finite 
Weil group S 3 of 50(3).) It is our guess that the geometric results of [15] can be 
reformulated in terms of convexity in the transversal. 

8.6. Exact relations 

We now turn to another implication of convexity discussed in Section 8.5 above. If 
we are searching for submanifolds M of End(T) corresponding to exact relations, 
then convexity of the submanifolds n„ = W„(M) is equivalent to saying that the II„ 
are affine subspaces of End(T). Observe now that Wn(Lo) = 0. Thus, choosing Lq 
to lie on M we make sure that n„ is a subspace for each n. The following theorem 
gives necessary conditions for M to be an exact relation. 

Theorem 2. Suppose the submanifold M is an exact relation. Then the subspaces 
Yin = W„(M) do not depend on n. Moreover, this single subspace 11 is closed with 
respect to the family of Jordan multiplications defined by 

Ki*aK 2 =^-(KiAK 2 + K 2 AKi), (14) 

where A can be any matrix from the subspace 

^ = Span{r'(/i)-r'(ci) |« eS^}. (15) 

The proof of this theorem follows word for word the proof of the corresponding 
theorem in [21, Theorem 3.5] for the symmetric case. From now on we will say that 
a theorem is proved in [21] if the proof for the general case is the same as for the 
symmetric case with obvious modifications. 

Another important question is about stability under homogenization. To this 
end, in [21], we have derived a formula for the effective tensor L*. 

Theorem 3. Let L* be the effective tensor for the composite with the local tensor 
L{x). Then for any n e 

W„{L*) = ((I - W„A„)-'W„(L(jc))), (16) 

where I denotes the identity operator on the Hilbert space HI = L?{Q) 0 End(T), 
W„ denotes the multiplication operator on H: (W„^)(jc) = Wn{L{x))H{x), and 
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A„ is defined by AnH{k) = A„{k)H{k), where 



An(m) = 



- n«), ifm^O, 

0, ifm = 0. 



(17) 



Using this formula, we can prove the following rather messy necessary and 
sufficient condition for a subspace O to correspond to an exact relation. 

Theorem 4. For v = (/i , . . . , /j^) € (Z^)^ and for a € the permutation group of 

k elements, we define 

o^(v) = aa(i),...,W))e(ZY- (18) 

Let 0(v) = {a(v) 6 (I?)^ | a e Sk) be the orbit of v under the action of the 
permutation group Sjc. The sub space O C End(T) corresponds to an exact relation 
if and only if for any k eN, any n e S^, any v = {l\, . . . , Ik) e (Z?)^ such that 

k 

5Z/.=0, (19) 

i=l 

and for any function K 11, we have 

I f[ KipMn ( E I I KiPk) 6 n, (20) 

(Pl,...Pk) eO(v) \s=l y=l // 

where A„ is defined in (17) above. 



The analogous theorem was formulated in [21, Theorem 3.9], where some steps 
of the proof were indicated. Unfortunately, the formulation of the theorem was not 
entirely correct. The sum in the formula [21, (3.38)] corresponding to (20) extended 
over the set Sk of all permutations instead of the elements of the orbit 0(v). In [21] 
the theorem was not used anywhere else and was included only for the purposes of a 
discussion. Here we give the correct formulation and a complete proof of the theorem. 



Proof The proof is based on the formula (16) and the analyticity properties enjoyed 
by the effective tensor. In [21, Appendix] we have shown that the map W„ is an 
analytic diffeomorphism, defined everywhere on the set of positive definite matrices. 
Therefore, all exact relation manifolds M, being analytic images of subspaces II, are 
analytic. Also, if we take an analytic family Lx(x) of local tensors, it will generate 
an analytic family of effective tensors [18]. The principle of analytic continuation 
implies that if 6 M for some small interval of A then e M for all A in the interval 

of analyticity of L^. This argument shows that if we want to prove that M is an exact 
relation, then it is enough to show that M satisfies necessary conditions of Theorem 2 
and that for one choice of w 6 and for any W(x) = Wn(L(x)) e FI sufficiently 
small, we have W* = W„(L*) € fl. We can ensure that W(jc) is sufficiently small if 
we choose the values of L(jr) sufficiently close to the reference medium Lq e M. 

Let He = {K\ + iK2 \ {K\, K 2 ] C 0} be the complexification of FI. The 
analytic continuation principle also implies that if FI corresponds to an exact relation 
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then so does Uc in the sense that if W(jr) e He then also W* e He, where 
W* is computed from W{x) by the formula (16). The argument is due to Milton 
(private communication). Let W;^(jc) = Wi(jc) + XW 2 (Jc), where Wi(x) e FI and 
W 2 (x) e n are assumed to be small enough. Then, since 11 corresponds to an 
exact relation we conclude that for real values of k in some finite interval, we have 

e n. Let P e End(T) be such that Tv(PK) = 0 for all K e U. Then the 
function f{k) = TriPW’^) is analytic in k and is zero on an interval on the real axis. 
Since W is assumed to be small, the region of analyticity of f{k) in the complex 
plane includes k = i. Thus, we conclude that Tr(PW*) = 0 for all real matrices P 
such that Tr(PK) = 0 for all liT 6 0. But 

Uc = {K e Endc(T) | Tr(PK) = 0 VP : Tv(PW) = 0 VW e U], 

Thus, W; € Oc. 

If W(x) is complex valued and sufficiently close to zero, then we can expand 
(16) in a convergent power series: 

W* = (W(AT)) + (WA„W(Ar)} + • • • + ((WA„)*W(a:)) + • ■ • , (21) 

where W is the operator of multiplication by W(x). It follows from the argument 
above that M is an exact relation if and only if each term of the expansion (21) 
belongs to He, when W(x) e ri(C is small enough. 

LtXTkix) = (WA;,)^W(x). Writing Tk{x) = W(x)(A„rjt-i)(J^) recursively, 
then taking the Fourier transform and using induction in fc, we can prove that 

fk(m) = (n ( LI 

h + • =m \s=\ J 

where m,lj e 7?. Thus, we get 

mA„fw{x)) = f *(0) = ( n I E 1 1 (22) 

^1+ • + /it=0 V=1 \; = 1 // 

Observe that the sum in (22) can be split into parts. The summation in each part 
goes over all the distinct permutations of the same set of vectors Ij. In order to say 
this more rigorously we define the action of the permutation group Sk on (Z^)^ by 
(18). Thus, (Z^)^ splits into the disjoint union of orbits of the group action. Let Z 
denote the set of orbits whose elements satisfy the constraint (19) (invariant under 
the group action). Then we can write 

= E E I n ( E 1 

0€2(/i,...,;*)€ 0 \i=i \;=i // 




( 23 ) 
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If M is an exact relation then we can choose an arbitrary ^-tuple (l\, . . . Jk) ^ 
satisfying (19) and let 

k 

7 = 1 

where {K\, . . . , Kk) C O are arbitrary. Then we will obtain (20). Conversely, if (20) 
is satisfied for all {A'l, . . . , C O then, by the analytic continuation principle, it 
is satisfied for all {K\, Kk) C He. Thus, each term under the exterior sum in 
(23) is in flc- Therefore, W* e Uc and is real, so W* e FI. The theorem is proved. 

□ 

The above theorem does provide algebraic conditions for O that guarantee 
that n corresponds to an exact relation. The practical utility of such a theorem is 
minimal, since it is virtually impossible to check infinitely many conditions that are 
as complicated as (20). Therefore, in [21] we proved a convenient sufficient condition 
for n to correspond to an exact relation. In order to formulate it we introduce the 
following terminology. The expression K\A\K 2 A 2 K 3 A 3 . . . K j-\Aj-\Kj will be 
called a y -chain. We say that a subspace II satisfies a 7 -chain property if for every 
{K \, . . . , Kj] C n and every {Ai, . . . , Aj-\] C A we have 

K 1 A 1 K 2 A 2 . . . Kj-iAj-iKj + KjAj-iKj-i . . . A 2 K 2 A 1 K 1 e 0. (24) 

We remark that if II is a subspace corresponding to an exact relation then Theorem 2 
says that II satisfies a 2-chain property. Now the analogue of the sufficient condition 
established in [ 21 ] can be formulated as follows. 

Theorem 5. If Tl satisfies the j -chain property for j = 2, 3, 4, then II corresponds 
to an exact relation. 

Proof In [21, proof of Lemma 3.7] we proved that if II satisfies the 7 -chain property 
for 7 =2,3,4, then O satisfies the 7 -chain property for all 7 > 2. It remains to show 
that if n satisfies the 7 -chain property for every 7 > 2 then ( 20 ) will be satisfied. 

Let T] be the permutation defined by ^ 7 ( 7 ) = /: 4- 1 — 7 . The permutation t] is 
an element of order two in the group Sk. It acts on 0(v) and splits 0(v) in a disjoint 
union of orbits. If an orbit contains two elements then their sum has the form (24). If 
the orbit contains a single element then this element is 1 /2 of the sum of two copies 
of itself, which is again of the form (24). □ 

In two space dimensions, however, we can say a little bit more. 

Theorem 6. In two space dimensions the 3-chain property is necessary for stability 
under homogenization. 

Proof. We will show that in 2D the necessary and sufficient condition (20) from 
Theorem 4 for = 3 implies the 3-chain property. 

Let n = (1,0). Fix N e Z \ (O). Choose l\ € to be linearly independent 
with n and define I 2 = —Nn — l\, so that I 3 = Nn. Observe that l\, I 2 , and I 3 are 
distinct and that according to (17), A(/s) = A(l\ + I 2 ) = 0. For simplicity we use 
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A(-) notation instead of since n = (1, 0) is fixed. Condition (20) for /: = 3 
then becomes 

K 2 A{ 1 \ + Nn)K 2 ,A{l\)K\ K\A{l\)K 2 tA{l\ + Nn)K 2 € 0 

for all {if 1 , A^ 2 , C 0. Now, if we vary N then we observe that the vectors 
(db(/i + Nn)/\l\ -f Nn\ : N eZ] form a dense subset of the unit circle. (The same 
construction does not yield a dense subset of the unit sphere in 3D.) Therefore, by 
continuity, we conclude that for any {^i , ^ 2 } ^ ^^^ve 

K2A(q2)K3A(qOKi + if 1 A(^l)^3A(^2)^2 € U 

for all {K\, K 2 y K 3 ] C O, which easily implies the 3-chain condition. □ 

We remark that in all the examples that we have worked out so far our sufficient 
condition was satisfied whenever the necessary conditions in Theorem 2 were. 

We would like to conclude this section by stating the results and problems 
obtained in [21] for the symmetric case. For a subspace X of End(T) let A'sym denote 
the set of symmetric parts of matrices in A'. If O C Sym(T) then Theorem 2 can be 
restated as follows. 

Theorem 7. If the subspace FI C Sym(T) corresponds to an exact relation, then it 
satisfies 

(nAn)sym c n. (25) 

The sufficient condition for Yl to correspond to an exact relation stated in The- 
orem 5 can be cast into a more attractive algebraic form. Let O' c End(T) be the 
smallest associative algebra containing the Jordan algebra FI. In other words FI' is 
the smallest subspace in End(T) containing FI that satisfies 

n'An' c n'. ( 26 ) 

Theorem 5 can then be restated as follows. 

Theorems. If the subspace Yl C Sym{T) solves {25) and has the additional property 
that n = n'yjjj, then O corresponds to the exact relation. 

The important open question is whether every Jordan algebra n (understood 
in the sense of (25)) is the set of all symmetric matrices of the smallest associative 
algebra Yl' (understood in the sense of (26)) containing 11. The question, in other 
words, is “Are sufficient conditions necessary?” 



8.7. Finding exact relations 

The preceding section established simple algebraic conditions that the subspace FI 
has to satisfy in order to correspond to an exact relation. Now the question is: Can 
we characterize all solutions FI of 



K 1 AK 2 + K 2 AK 1 e n. 



(27) 
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for all {jfiTi, 1(^2} C n and all A e A1 Unfortunately, at present we do not have an 
efficient way to solve equation (27). In this section, we will discuss an inefficient 
way of solving (27) for the 2D Hall effect. The method simply consists of picking a 
matrix and computing by brute force the smallest Jordan algebra FI containing that 
matrix. Then we compute all other Jordan algebras extending the smallest one, again, 
by brute force. This method is applicable to rather small dimensional cases (or small 
block-dimensional as in [21]). 

For the 2D Hall effect, the space T = Let 



Lo = oro + ocS 



(28) 



be the positive definite reference medium, where ao is the symmetric part of Lq and 



5 = 






(29) 



We recall that for the Hall effect the matrix r'(/i) is given by (11). Therefore, the 
subspace A defined in (15) is 

A={Ae End(R^) 1 Tr A = 0, ALJ = LqA^}. (30) 



8.7.1. Solving (27) 

Observe that equation (27) behaves nicely with respect to the following “change of 
variables.” Let X and Y be invertible matrices. Let H = XTIY. Then fl solves (27) 
if and only if U solves (27) with A replaced by A = Y~^AX~^. Applying this 
observation to 



X = Y = LoX, (31) 

we obtain that A is the space of trace- free symmetric 2x2 matrices. From now on we 
will work only with subspaces A and O, and therefore we will rename them A and fl, 
respectively, for notational convenience. We will return to the original notation when 
we have solved equation (27). Thus, A now denotes the subspace of 2 x 2 symmetric 
trace-free matrices. 

Observe that for X = R and Y = , where is a 2 x 2 rotation matrix, the 

equation KAK C fl, if e FI remains invariant. Now fix ^ e Fl. We can find a 
rotation R e 0(2) such that 



R K = Ko = p 



1 P 

—P a 



(32) 



Thus, without loss of generality, Fl contains a matrix Kq of the form (32). We have 




1 (1 + ^2)^ (a + p^)t + p(l-a)s] 

-P a\ \_{(x + p'^)t - P(\ - a)s -ipc^ + P^)s \' 

(33) 
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Case I: There exists ^ H with a + 7 ^ 0. In this case, we easily see from 

(33) (setting s = 0, t = 1) that 



Consequently, 



roiir^ t Ifoii^f-^r 
[1 oj [r [1 oj ■" [ r 



implies that A C Tl. Simple manipulations show that the right-hand side of (33) is 
equal to (1 — a)sKo mod A. Thus, we get that FI contains the subspace A and the 
matrix 

,_r(l+a)/2 p 1 

«o-[ (l + a)/ 2 j- 

Case 1(a): There exists /Fq € O such that a ^ —1, in addition to ot ^ Q. 
Then we infer that 

where p' = 2j3/(l -f a). Again an easy calculation shows that 
is equal to 

2{sx + ty) ^ j (mod A). 

Thus, the minimal subspace O satisfying (27) in Case 1(a) is a 3D subspace 

n = ^©| 'J'ji. (35) 

Since this subspace is of codimension 1, there are no larger proper subspaces con- 
taining n. We note that (35) gives a one-parameter family of solutions fl, labeled by 
p' eR. 

Case 1(b): a = —I for all Kq € 11. Then, returning to (34) we obtain that 

r 0 „ 



is equal to 



2{sx -h ty) 



(mod A). 



Thus, the minimal subspace O satisfying (27) in Case 1(a) is a 3D subspace 



n = 



Now if all iTo € n are symmetric, then = 0 for all iTo € fl and II = ^ or 
n = Sym(R^) — the space of symmetric 2x2 matrices. 

If there is ^ H which is nonsymmetric, then 

n = ^© (36) 

This subspace is of codimension 1 and therefore is not contained in any other proper 
subspace. 
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Case II: a = for all Ko e II. In this case, we easily see that 

n = [_V40® 

is a one-parameter family of ID solutions of (27). Now we would like to see if there 
are other solutions IT containing (37). Let K\, not a multiple of Kq, be in O. Then 
for all A. € R the matrices K(k) = K\ -{■ XKq belong to fl and to Case II (we have 
already analyzed Case I). Thus, K (k) can be reduced to the form (37) with a different 
P by 2 i rotation and scaling: 

K(X) = 1^ -Pikf ] 

It will be convenient to represent K\ in the form K\ = P yS, where P is a sym- 
metric matrix and S is given by (29). Taking the antisymmetric part in the formula for 
K{X) we obtain ^{X)P{X) = y + XP, while taking the determinant of the symmetric 
part of K{X) we get 

det P + X(p22 - P^Pn) - = -iiixfpikf, 

for all A. € M. Thus, we obtain the following equations for y and for the components 
Pij of P: 

P11P22 - Pi2 = P22- P^pn = -2yP. 

Eliminating p 22 we get (puP — y)^ = Pj 2 - So either 

P \2 = PnP - Y (38) 



or 



Pl2 = Y - PllP- 

Assume first that K i satisfies (38). Then 



(39) 



Ki = 



Pn PnP 

PUY - 2 k pnP'^ -2 yP 



Now observe that 



^1 — Pi\Ko = 2{pnp 









Notice that if y = p\\P then K\ = p\\Kq in contradiction to our assumption that 
K\ is not a multiple of Kq. Thus, we conclude that 



n,=Spanl[""],[Jg])cn. 



(40) 



One may easily check that Hi is a one-parameter family of 2D solutions. 

Making a similar analysis for case (39), we obtain a solution fl 2 C FI, where 



ri 2 - Span I Q . 0 } ■ 



(41) 
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If n contains FIi but is larger than II i then our previous analysis shows that € 11 
must be either in II i or in U 2 > But Hi |J 02 is not a subspace. 

In summary, a proper subspace O is a solution of (27) if and only if it is a 
rotated image of either A, Sym(R^), (35), (36), (37), (40), or (41). We can now state 
the result. 

Theorem 9. If a proper subspace O in the space of all 2x2 matrices satisfies (27) 
with A being the space of symmetric, trace-free 2 x 2 matrices, then O is a subspace 
from the following list. 

1 . Sym(R^) — the space of symmetric matrices; 

2 . Oo — the space of trace-free matrices; 

3 . A — the space of symmetric, trace-free matrices; 

4. 

n^ = {[“ : a + rf = ^(*-c)}, ^5^0; 

5. Oa = {v (g)a I V € R^}; 

6 . nj; 

7. Y\a,h = R(« ®b). 

Observe that in our list items 1 and 2 are limiting cases of item 4: O^ ^ Oq, 
when p 0 and O^ Sym(R^), when oo. We also observe that A is the 
intersection of and O^ for any ^ ^2- Also 0^, ^ z= O^ P| Oj. Thus, we need 
to focus only on three subspaces from items 4, 5, and 6 . 

8.7.2. Checking sufficient conditions 

According to Theorem 5 and our discussion above we need to check the 3- and 4-chain 
property of subspaces FI from items 4, 5, and 6 in the list above. Indeed, the intersection 
of subspaces satisfying these properties must also satisfy them. The subspaces Fla 
and FI J are closed with respect to associative multiplication: ^ 1 ^X 2 € FI whenever 
[K\, K 2 } C n. Thus the 7 -chain property is clearly satisfied for those subspaces. 
The actual checking needs to be done only for the subspaces O^. The checking can 
be easily done with Maple — the symbolic algebra package. And indeed, we find that 
does satisfy 3- and 4-chain properties. 

8.7.3. Returning to L variables 

Now that we have a list of subspaces 0 corresponding to exact relations, we need to 
return to L variables. At the first glance the task before us is simply to compute 

M = {Lo - [/ + i:r'(n)]~‘ KLq \K eU), (42) 

which is the inverse of W-transformation. However, we quickly realize that in order 
to get explicit results, the computation in (42) is not so easy. Fortunately, we can often 
simplify our job and sometimes avoid it altogether. One obvious observation that we 
have already made is that if we have computed two exact relations then there is no 
need to compute their intersection. 




8. Exact Relations 



185 



In [21] we identified an especially simple class of exact relations: uniform field 
relations (UFR). In general these are defined as 

M = {L € End(T) | Lai = 6i, . . . , La, = (43) 

for fixed uniform fields {ai, . . . , a,, . . . , C T. These are easily recognizable 

at the level of subspaces fl. 

Theorem 10. The uniform field relations (43) are in one-to-one correspondence with 
subspaces V cT, The corresponding subspace O is the annihilator of V : 

n=:[K e End(T) | A;i; = 0 Vv g V}. 

The subspaces Fla are annihilators of subspaces = Ra-^, where = Sa. 
Thus, the exact relations corresponding to subspaces fla are the sets of positive 
definite matrices L such that La = v for fixed vectors {a, v] C R^. Obviously II J 
corresponds to the same class of exact relations, where L is replaced with L^. 

The only exact relation here where we do need to compute something is 0^. 
The following theorems were proved in [21] to facilitate our task. 

Theorem 11. Fix n e Let M be such that K(r'{n) — M)K G II for all JK" G O. 
Then the invertible transformation Wm — [S{L) — M]~^ maps M into FI. 

Recall that in our example of the 2D Hall effect, we found it easier not to 
work with subspaces FI and A directly but rather with subspaces fl = XFIF and 
A = Accordingly, we fix n and define F = Y~^r'(n)X~^. 

Theorem 12. Let M be such that 

K(T-M)K eU (44) 

for all K e Yl. Then the invertible transformation Wm = [Y~^S{L)X~^ — M]~^ 
maps M into FI. 

Applying this theorem to the case at hand with X and Y defined by (31), we 
obtain the inversion formula that we will use: 

L = ao + ocS- al/hl + KM]~^ (45) 

where ao and a are related to the reference medium Lo via (28). The utility of (45) 
is in the fact that it allows us to compute the manifold 

M = {[/ + A-M]-'a: \K eU] 

using simplified objects M and FI. The actual exact relation is just an affine image 
of M. 

In our example, F = a (g) a, where u = For II = Sym(R^) 

we easily see that A/ = 0 satisfies (44). So the exact relation M says that if we mix 
materials with the same Hall coefficient ro then the mixture will have the same Hall 
coefficient ro- 

Now let us compute the exact relation corresponding to Fl^. Here M = 0 does 
not work, so we choose M = F = a (g) a. Observe that the choice M = F always 
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satisfies the conditions of Theorem 12. In our simple example of the 2D Hall effect, 
the choice M = T is still simple enough for practical purposes. In other contexts 
such as elasticity, thermoelectricity, or other coupled problems the matrix T is not so 
simple, and other choices for M work (see [21] for such formidable examples as 3D 
thermopiezoelectricity). Observe that the subspace 11^ is a hyperplane defined by the 
equation Tr(/ir Q) = 0, where 




Solving (45) for K, we obtain 

A" = [(P - L')-* - « ® k]~\ (46) 

where V = <Tq P = / + a'S and a' = adetag Since the answer 
does not depend on the choice of the vector ii, we can set ii = (1 , 0) and simplify the 
equation Tr(iSi 2) = 0 with Maple. The result is written most conveniently in terms 
of the conductivity tensor a and Hall coefficient r, so that L = a + r5: 

det or 4- (r — Of — ^Sy^detao)^ = (1 + det gq. (47) 

The exact relation we obtain can be written most concisely as 

M = {(<r, r) I deta + (r ~ ro)^ = const}. (48) 

This relation was first derived by Milton [37] (see also [14]). When ^ 0 the exact 

relation (47) still retains the form (48). If ^5 oo the exact relation (47) becomes 
r = ro, first derived by Stroud and Bergman [47]. We note that the intersection of 
exact relations (48) and r = 0 results in the well-known Keller-Dykhne-Mendelson 
exact relation [13, 26, 36]. 

8.7,4. Exact relations with volume fractions 

Very often exact relations are supplemented by other relations involving volume av- 
erages. For example. Hill’s exact relation for elasticity mentioned in the Introduction 
has an extra part: 

(3/f* + = {^Kix) + 4/i)-'), (49) 

where /x is the common shear modulus, /r(jr) and k* are the local and effective bulk 
moduli, respectively. In order to obtain these additional relations, we need to compute 
the derived Jordan ideals for each subspace II satisfying (27). 

Definition 3. The derived Jordan ideal of the solution U of (27) is the subspace 
= Span{Ai *4 K 2 \ {Ai, A 2 ) C O, A e A], 
where the Jordan multiplication is defined in (14). 

Theorem 13. Let be the derived Jordan ideal ofU and suppose fl^ ^ H. LetJJ 
be the orthogonal complement ofYl^ in II. Then we have 



( 50 ) 
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where Vj\f denotes the orthogonal projection onto M and M satisfies the conditions 
of Theorem 11. 

We actually computed the derived Jordan ideals for each subspace n when we 
were computing all the solutions of equation (27). The reader may go back and verify 
that for each solution FI for the 2D Hall effect we have fl^ = fl. The simplest context 
where nontrivial derived Jordan ideals appear is 2D elasticity, which is a bit more 
involved and therefore less suitable for the purposes of the present review than the 
2D Hall effect. We refer the reader to [21], where there are plenty of exact relations 
with nontrivial derived Jordan ideals and corresponding volume average relations. 

8.7.5. Links between uncoupled problems 

Now we can ask the following question: Is there a link between conducting and 
thermal properties of a composite? More generally, is there a link between L* and 
F* — the effective tensor for This question has also been investigated in 

[21], where / is some nonlinear map between End (7]) and End (72). The answer 
uses the concept of the Jordan ideal. 

Definition 4. A subspace /C of a solution 11 of (27) is called a Jordan ideal in H if 
K\ K 2 ^ IC for all A e A, slW K\ e JC, and all K 2 e H. 

For example, the derived Jordan ideal is a Jordan ideal. As is customary in 
algebra, an ideal is good for factoring over it. Let 7^ = FI //C be the factor space in 
the sense of vector spaces. Then !F has a set of well-defined Jordan multiplications: 

TTl Kl K 2 , 

for all A e ^ and all [^i, X 2 } C H, where K e T denotes the equivalence class 
ofK. 

In order to treat the links between uncoupled problems we need to discuss 
coupled problems, at least in passing. As we have argued in Section 8.3, each physical 
context gives rise to a vector space T where the pair of relevant physical fields takes 
its values. If we have two problems with spaces T\ and T 2 then the vector space T 
corresponding to a coupled problem is T = Ti 0 72. Exact relations for the coupled 
problem would correspond to subspaces of End(71 0 T 2 ). 

Definition 5. The links between uncoupled problems are those exact relations for 
the coupled problem whose subspaces FI lie in the “block-diagonal” part End (7]) 0 
End(T 2 ) of End(Ti 0 7^). 

The technical difficulty is that in general A Ai 0 A 2 . In [21] we were able 
to avoid this difficulty by focusing on polycrystals, where we were able to replace the 
subspace A by a single matrix f. Nevertheless, we can still state the general theorem. 
In what follows, we use the notation [A, B] for 




in order to save space. 
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Theorem 14. Let n C End(Ti) 0 End(72) be an exact relation. Define 

Oi = {K\ e End(Ti) : [K\, K 2 ] e Tl for some K 2 e End(7^)}, 

YI 2 = {K 2 e End(T^) : [A'l, K 2 ] e U for some K\ e End(Ti)}, 

/Cl = [Ki e End(Ti) : [^ 1 , 0] eU], ^ ’ 

JC 2 = {K 2 e End(T 2 ) : [0, K 2 ] e O}. 

Then the subspaces Uj ofEnd(Tj), j = 1,2 are Jordan algebras in the sense of 
equations (27). The subspaces JCj C Uj, j = 1,2 are Jordan ideals. There is a 
natural linear isomorphism ^ \ !F\ = Y\\/Ki\ ^ ^ 2/^1 = ^2 defined by the rule 
1^ = ^(K\), whenever [K\, K 2 ] G 0. The map 4> is well defined and satisfies an 
important additional condition: 

^(Tx *A2 wo (52) 

for every {K\, K\] C Hi and every [A\, A 2 ] e A. 

Please note that if [Ai , A 2 ] e A then, in general, A 2 is not determined by Ai 
uniquely nor are Ai and A 2 independent. Therefore, the linear isomorphism 4> with 
the property (52) is a Jordan algebra homomorphism. 

For the specific example of the 2D Hall effect, we simply used the brute-force 
Maple computation to figure out the subspace A. Let Lq = [Li , L 2 ] be the reference 
medium through which the link M passes. As before it will be convenient to work 
not with the subspace A directly but with the subspace A = Y~^AX~^, where 
X = and F = [Li, L 2 ]X. Then A is described in terms of Ao = 

{A e Sym(R^) 1 Tr(A) = 0}, depending on what L\ and L 2 are. 

Theorem 15. Let a j, j = I, 2 be the symmetric parts ofLj. If there is a scalar .y > 0 
such that a 2 = , then A = {[A, A] : A 6 Ao}. Otherwise, A = Ao 0 Ao- 

Now that we know the subspace A, we can apply Theorem 14. First, consider 

the case when cr \ and a 2 are not multiples of one another. Then we can set A 2 = 0 in 
(52) and, recalling that 4> is a bijection, conclude that for every [K\, K\] C Hi and 
every Ai € Ao we have K\ = 0. The meaning of that last relation is that the 

ideal JC\ contains the derived ideal fl But we have already verified that there are 
no nontrivial derived ideals in the context of the 2D Hall effect. In this case, we have 
to conclude that IC\ = Hi resulting in the trivial link: [L\(x), L 2 (x)]* = [L*, L^] 
with no relation between L* and 

Now let us assume that <T 2 = 5'ori. Here we will get some interesting results. 
We begin by finding all pairs of nested exact relations from the list in Theorem 9 (to 
which the space End(R^) is added) and checking if the smaller subspace in each pair 
is a proper Jordan ideal of the larger. The checking is routine and may be automated 
with Maple even for problems of large size. We find that none of the subspaces from 
Theorem 9 is a Jordan ideal in any of the other subspaces, including End(R^). Thus, 
we must necessarily have that IC\ and IC 2 are zero subspaces and that 4> is a Jordan 
isomorphism 4> : Hi ri 2 

<t>(Ki^A K2) = Wi)^aW2) 



(53) 
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for all A e Aq. 

We will now describe how to find all the Jordan isomorphisms 0 between the 
subspaces fli and II 2 satisfying (27). The method here is based on a nice relation 
between 2x2 matrix algebra and complex arithmetic. To a complex number z = a-\-ib 
we associate a vector 7t(z) = {a,b) and two 2x2 matrices: 



Hz) = [l 



-b 

a 



and \lr{z) = 



a b 
b —a 



In the remaining part of this section, the bold lowercase letter will denote a 2D vector 
corresponding to a complex number denoted by the same nonbold letter. In situations 
where such simplified notation is inadequate we will use notation 7r(*) defined above. 
The functions (p and xp enjoy many special properties. For example, 



(p{z)u = 7t{zu), f{z)u = n(zu). 



As a corollary we have the following multiplicative identities: 

<l>(Zl)(l>(Z2) = (piZlZl), (t>(Zl)xp(Z2) = i^(Z\Z2). 



f(Z\)(piZ2) = i^{ZlZ2). ^(Z\)if(Z2) = 0 ( 21 ^ 2 ). 

Observe that if 0 is the Jordan isomorphism between Hi and 02 then dim Hi = 
dim ri 2 . Therefore, our strategy is to go through every dimension 1 through 4 and 
determine all Jordan isomorphisms between subspaces O from Theorem 9 of that 
dimension. 

Dimension 1. All ID solutions of (27) have the form Yla,b- 

Proposition 1. The linear mapping ^ : Ua^b defined by <I>(a(S)i) = c(S>d, 

is a Jordan isomorphism between Y\a,b (^^d Y\c,d if and only ifab = cd. 

Proof. For every A G ^0 we have 

{a 0 h)A{a 0 ^) = {Aa • b)a 0 b. 

Applying the mapping <J> to this relation and using the property (53), we obtain 
(c 0 d)A{c 0 J) = (Aa • b)c 0 d. 

Thus, we must have 



Aa b = Ac ' d (54) 

for every A g Aq. Now observe that Ao = {0(z) | z G C}. Therefore, equation (54) 
becomes ?H(z(ab — cd)) = 0 for every z G C. Thus, O is a Jordan isomorphism if 
and only if ab = cd. □ 

Dimension 2. There are three classes of 2D solutions of (27). We denoted them 
Ao, Ha, and Uj. 

Proposition 2. There are no Jordan isomorphisms between Ao cind Oa and between 
Ao cmd n J. The only Jordan isomorphism between Ao and Ao is the identity mapping. 
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Proof. Suppose <I> is the Jordan isomorphism between and Ha. Then there exists 
a nonsingular matrix V such that <I>(V^(m)) = Vm 0 a for all m e C. Applying the 
Jordan isomorphism ^ to 

= ylr{u^z), (55) 

we obtain 

(Vii 0 a)xlr(z){Vu 0 a) = V7t{u^z) ® a. 

Therefore, (V^(z)a*Vii)M = which implies that wz must be real for all {w, z} C C. 

This is impossible, and therefore the Jordan isomorphism <J> between Ao and flfl does 
not exist. The Jordan isomorphism between Ao and 11 J is also impossible. Indeed the 
mapping K is the Jordan isomorphism between fla and 11 J. If there was a 

Jordan isomorphism between Ao and 11 J then there would be a Jordan isomorphism 
between Ao and 11^, which we proved does not exist. 

Now let <I> be the Jordan isomorphism from Ao to itself. Then there exists a real 
linear invertible mapping : C -> C such that ^(V^(w)) = V^(i?(w)). Applying O to 
(55) we get (i^(u))^z = t^(u^z). The linear map has the form i^(w) = 0\u -{■ O 2 U 
for some complex numbers 0\ and 62 - Equating the coefficients at z and z in 

(Oiu + 02u)^z = 0\uh + 02U^Z, 

we obtain that $2 = 0 and ^1 = 1. Thus, <I)(V^(m)) = V^(w) for all u eC. □ 

Proposition 3. The unique Jordan isomorphism ^ between Yla and 0* is given by 

^(v 0 a) = (l>(alb)v 0 b. (56) 

The unique Jordan isomorphism 4> between Fla and is given by 

0 a) = b®(j>{a/b)v. (57) 

The unique Jordan isomorphism <I> between FI J and FI is given by 

<I>(a 0 v) = i 0 <\>(alb')v. (58) 

Proof For a linear isomorphism between Fla and Fl^, there exists a nonsingular 
matrix V such that <|)(i; 0 a) = Vv 0 A. Applying ^ to 

(u 0 a)\lf(z)(v 0 a) = (\lr{z)a • u)v 0 a, 

we obtain 



(Vv 0 b)\jr{z){y V 0 fc) = (ir(z)a • v)Vv 0 b. 

Therefore, for every z G C, we have V^\l/(z)b = ’^{z)a. The map V as a map on C 
can be written as 

7r~^(ya) = v\u -h V 2 U 

for some complex numbers v\ and V 2 - Therefore, we have 

v\zb + V 2 Zb = zd. 
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Equating coefficients at z and z we obtain that 1^2 = 0 and v\ = ajb. Thus, (56) gives 
the unique Jordan isomorphism between Tla and 

Now, if 4> is a Jordan isomorphism between Tla and TlJ then r o is the Jordan 
isomorphism between Tla and fl*, where r(^) = . Hence there is a unique 

Jordan isomorphism between and Oj given by (57). Similarly, if <I> is a Jordan 
isomorphism between FI J and Uj , then r o O o r is a Jordan isomorphism between fla 



Dimension 3. The 3D solutions of (27) are the subspaces from Theorem 9. 



Proposition 4 . For any pair , )02} C M U {00} there are Wo Jordan isomorphisms 

between and Tl^ given by 

^{K) = <D(A-) = (59) 



where a is the unique solution in [0, tt) o/ 



tan (a) = 



1 + PlPl 



Proof, Suppose 4> is the Jordan isomorphism between fl^j and Tl^^, Observe that 
Aq C for any /S. We conclude that 4>(>to) = since JIq is not isomorphic to any 

other Jordan algebra but itself. Moreover, the restriction of 0 to must be the identity 

map. Observe that any matrix K e T\^^ can be written as AT = jc0(e*^) + ^(m), 
where ^\ = — cot y \ . Any linear map 4> from Tl^^ into 11^2 ^^at is the identity on Aq 
has the form 



<I>(A:</)(e'^^) + V^(w)) = ax(j>{e^^) + \lr(u -\-xp) 

for some a e M, and p e C. Here ^2 = — cot 72- Applying the Jordan isomorphism 
^ to 

ix4>{e’^') + \lf(u))\lf{z)(x^ie''>'') + \ir(«)) = 2xm(zu)<p(e'^') + \lr(x'^z + u^z), 

we obtain 



(ax<p(e'^2) 4 - ir(u -i- xp))r/r(z)(ax<p(e^^^) -I- }/r(u xp)) 

= 2xa^H(zu)^(^^^) + 4- u^z 4- 2x^H(zu)p). 

Since any 2x2 matrix can be uniquely written as 0(zi) 4- we obtain, by 
equating arguments of 0, 



di(zu) = ^(zu-hxzp). 

It follows, therefore, that p = 0. Equating arguments of we get 

x^Z 4- u^z = a^x^z 4- u^z. 

It follows that either a = lora = — 1, corresponding to the two Jordan isomorph- 
isms given by (59). □ 
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Dimension 4. 

Theorem 16. If ^ is a Jordan isomorphism of End(R^), then either ^{K) = 
(j){e^^)K^{e^^) or = <f){e^^)K^ (!>{e^^). Moreover, all Jordan isomorphisms 

between Hi and Yli — solutions of (21) — are restrictions of the Jordan isomorphisms 
o/End(R^) to rii. 

Before proving the theorem we remark that all Jordan isomorphisms O of 
End(R^) have the property 

(t>{KxAK2) = <D(^:i)A<D(^2), 

for all A E Ao- Therefore, all the 7 -chain properties are obviously satisfied. Thus, all 
the links that we find are stable under homogenization. 

Proof Let 0 be the Jordan isomorphism of End(R^) . If we restrict 4> to a 3D subspace 
{x(j){e^^) + I jc E R, u eC] then we have, according to Proposition 4, 

^{x(f){e^^) + ylr{u)) = x(j){e^^e^'^) + \lr(u). 

Therefore, for every y e [0, tt), every x e R and u e C 

^(xcpie^'^) 4- V^(w)) = x(j){e^^^^^e^'^) + 

But <I> is linear and therefore there are complex numbers p\ and p 2 such that 0 (0 (z)) = 
0(piZ + P 2 ^- Consequently, = p Thus, for every y E [0, tt) 

we have \p\ + = 1- This equation says geometrically that the circle cen- 

tered at p\ with radius IP 2 I is a subset of the unit circle. There are only two possi- 
bilities: Pi = 0, IP 2 I = 1 or p 2 = 0, Ipil = 1. Thus, either O(0(u) -f 0(w)) = 
(j>{e^^v) + 0(m) or ^(0(u) -h 0(w)) = <t>(e^^v) + 0(m). In the matrix notation, either 
<I>(/ir) = or <I>(A') = and the first part of 

the theorem is proved. 

We have already described all the Jordan isomorphisms between solutions of 
(27). It is now a simple matter to check that each one of the Jordan isomorphisms 
described in Propositions 1-4 is indeed a restriction of one of the Jordan isomorphisms 
of End (R^). n 

We remark that the group of all Jordan isomorphisms of End (R^) is isomorphic 
to 0(2). Another remark is that Theorem 16 saves us a lot of work. All we have 
to do is to compute the links between uncoupled problems for End(R^). Any other 
link is just a restriction of the global link to one of the exact relations computed 
before. We also observe that the Jordan isomorphism r(K) = corresponds to 
the link (L(x)^)* = (L*)^, which is a theorem of Murat and Tartar [40] that says 
that if L^(x) H-converges to L*(x) then {L^{x)Y H-converges to {L*{x))^ . If we 
combine this link with the links corresponding to 4>(Jf) = <l)(e^^)K<j)(e^^) we will 
obtain the links corresponding to ^(K) = ()){e^^)K^ <j){e^^). All we have to do now 
is to apply the method of Section 8.7.3 to the subspace 

n = [[K,(j){e^^)K<t){e^^)] : K e End(R^)}. 




8. Exact Relations 



193 



\ j'y I/O 

According to the formula (46) we define , P\ = I -\-ot\S 

and L '2 = Pi = I otiS, Then the link between L\ and Li 

has the form 

j^(P2 - ~ ® “] ^ - L\)~^ -u®u^ 0(e‘“). (60) 

The relation between L\ and L 2 does not depend on the choice of the unit vector u. 
Indeed, we can rewrite (60) as 

(P 2 - = <p{e~‘“)(Pi - L\y^4>(e~“‘) + u®u- <p(e~‘“)u ® <j>{e'“)u. 

Now we use our complex calculus to show that u 0 u — (f>(e~^^)u 0 (p(e^^)u does 
not depend on the choice of the unit vector u. Indeed, a rank-1 matrix a® b can be 
written as ((l>(ab) + \l/(ab))l2. Thus, we have 

M 0 II — <l>(e~^^)u 0 <j){e'^)u = (^(1) + f(u^))/2 

— (0(^“*“m^*“m) -f ^(^“"*“Me*"w))/2, 



and therefore 

L '2 = P 2 - [0(e-'“)(Pi - L'i)-V(e"'“) + sin(a)^(ie-'“)]"‘ . (61) 

We examine the right-hand side of (61) with Maple and after a lengthy investigation 
obtain the link in terms of the conductivity tensors a and the Hall coefficient r: 



^^(ro -ri)2 + det<ri’ 



_ rp-n 

^^(ro -ri)^-f detai 



+ ^o> 



(62) 



where cp > 0, rp and qp are constants. This link was first obtained by Dykhne [12] 
in a particular case of an isotropic composite made with two isotropic phases. In its 
present form the link was first derived by Milton [37]. The links corresponding to the 
the remaining Jordan isomorphisms can be obtained by combining (62) with the link 
L 2 = L^. In other words, the other set of links has the form (62) but with r\ replaced 
by -n: 



^^(ro + ri)2 -bdetai 



_ rp + r\ 

(ro + ri)2 -fdet or 1 






(63) 



Following Milton [37], we observe that the trivial exact relation “ri(Ar) = 0 implies 
r* = 0” transforms into the relation (48) according to the link (62) (or (63)). 

Restricting the links to the case of conductivity we obtain the well-known re- 
sult [36, 39] 



(a(x)/deta(jr))* = a*/ deter*. 



It is possible to play further with the links and exact relations, applying them to the 
case of two-phase composites, polycrystals, and such. We will not pursue this and 
stop here. 
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8.8. Conclusions 



Our theory of exact relations permitted us to derive the following microstructure 
independent results for effective properties of composites in the context of the 2D 
Hall effect. All of them were previously known (see [12, 14, 37, 47]). In a weak 
magnetic field the local conducting properties of a 2D composite are described by the 
nonsymmetric positive definite 2x2 matrix field L{x) = < t { x ) -f r{x)S. We have 
obtained the following complete list of results. 

1. If L(x)a = b for d\\x e Q then L*a = b. 

2. If r(x) = ro then r* = tq. 

3. If L* is the effective tensor for L{x) then (L*)^ is the effective tensor for 
L{x)^. 

4. If (r(Ar) — ro)^ + detff(A:) = co forall x € Q then (r* — ro)^+det(r* = cq. 

5. If L* = a* + r*S is the effective tensor Jfor L{x) = cr(jc) -f r(x)S then 
L* = a* +7*5 is the effective tensor for L{x) = a(jc) + n(jr)5, where 



a(x) = CO 





(ro-r(j«:))2 + deta(jr)’ 



T{x) = CO 



rp - r(x) 

(ro-r(x))'^-i-dQia(x) 






and 



cr = CO 



(ro -- r*)2 + deta* ’ 



r = co- 



rp-r^ 



(ro — r*)2 + det(7* 



+ <?o* 



Technically speaking, there are other results, but all of them are consequences of the 
ones listed above. Our theory also guarantees that there are no other microstructure- 
independent equalities. 

Throughout this paper we have described several important open questions in 
the theory. We summarize them here. 

• The polycrystalline G-closures possess an important convexity property that 
was the cornerstone of the present paper. Can it be used to obtain a finite 
geometric algorithm for constructing G-closures? Such a construction was 
found by Francfort and Milton [15] for 2D conducting polycrystals, sug- 
gesting a positive answer to our question. For more detailed discussion see 
Section 8.5. 

• Equations (27) have to be solved in each physical setting to determine all 
the exact relations there. So far we have been able to solve these equations 
for problems of modest size, where the brute-force approach works. Yet, 
equations (27) say that the subspace 11 has special structure of the Jordan 
algebra. Can this structure be used to help solve the equation? In fact, our 
result [21, Theorem 5.2] suggests that the question is meaningful. 

• Finally, there is the question of whether the sufficient conditions for a mani- 
fold to be an exact relation are also necessary. See the more detailed formu- 
lation of this question at the end of Section 8.6. 
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9, A Multifield Theory for the Modeling 
of the Macroscopic Behavior of Shape 
Memory Materials 

Davide Bemardini* and Thomas J. Pence^ 



Abstract. The macroscopic behavior of shape memory materials is modeled 
within the framework of multifield theories. Two scalar fields and a second-order 
tensor field are used as descriptors of the relevant microstructural phenomena. In 
this way it is possible to allow for pseudoelasticity and shape memory effect, as 
well as low and high temperature reorientation of Martensitic variants. The gen- 
eral aspects of the theory are discussed paying special attention to the treatment 
of balance equations and to the exploitation of the constitutive structure, which is 
characterized by the prescription of a response function for the entropy production. 
An example of an explicit model is also given. 



9.1. Introduction and literature survey 

The term shape memory material (SMM) describes a class of materials that exhibit, 
at the macroscopic scale, the properties of pseudoelasticity and shape memory (see, 
e.g., Otsuka and Shimizu, 1986, Otsuka and Wayman, 1998). These behaviors arise 
in materials that transform between different solid phases in a manner that does 
not require atomic diffusion. These transformations can be induced by mechanical, 
thermal, or even magnetic, energy supply (Fischer et al., 1994). Among the several 
examples of materials belonging to this class, binary and ternary metallic alloys 
like NiTi and CuZnAl are widely used in several fields of engineering, industry 
and medicine (Melton, 1999). Pseudoelasticity refers to the ability of small changes 
in stress to generate large, but bounded, changes in strain. Unloading may or may 
not reverse the deformation and recover the original shape, the determination being 
dependent on the material temperature. The original shape is recovered on unloading at 
high temperature, with the reverse deformation taking place at lower stress levels than 
those associated with the original deformation. The original shape is not recovered at 
low temperature meaning that a residual deformation remains after unloading. This 
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residual deformation can be annihilated by heating, thus returning the material to its 
original shape and yielding the so-called shape memory effect. 

These effects arise because the high temperature phase, called Austenite and 
abbreviated in this section by A, has a higher crystallographic synunetry than the 
low temperature phase, called Martensite and abbreviated in this section by M. The 
Austenite can therefore transform into one or more variants of Martensite that differ 
mainly by their orientation relation to the Austenite parent. By contrast, all Martensite 
variants tend to transform into a single common Austenite crystal structure. The dif- 
ferent crystallographic structures of Austenite and the Martensite variants account for 
the large deformations at the macroscopic scale. Transformation from A to M gives 
rise to variant combinations that are energetically favored in the prevailing stress 
field, thus providing the high temperature pseudoelasticity. Transformations between 
different M variant structures are also driven by changes in the stress field, thus pro- 
viding a low temperature pseudoelastic effect that is called reorientation. There is 
hysteresis in transformation between A and M and in M variant reorientation. This 
hysteresis accounts for the absence of shape recovery upon unloading at low temper- 
ature and also accounts for the lower stress values associated with shape recovery at 
high temperature. 

In this article we present a framework for describing these materials in the con- 
text of multifield theory. In addition to the usual field variables of position x and 
temperature the existence of both A and M phases motivates additional scalar field 
variables of Austenite phase fraction and Martensite phase fraction In addition, 

different M microstructures must be acknowledged due to the variety of variant com- 
binations within individual crystallite grains. In polycrystals^ the grain texture then 
gives rise at the macroscopic scale to what may be regarded as a continuous distribu- 
tion of different varieties of Martensite. These are distinguished here by an additional 
second-order tensor field M that is characteristic of the effective macroscopic trans- 
formation strain of the Martensite. The additional fields ^a/, M are subject to 
balance principles with the same status as the standard balances of momentum and 
entropy and participate in the energy balance with the same status as the usual fields. 
This framework permits a unified description of shape memory, pseudoelasticity, re- 
orientation, and hysteresis as appropriate for a macroscopic model. The framework 
developed here is limited to quasistatic processes and so is not appropriate for is- 
sues of wave propagation and inertial dynamics. Fully dynamic processes could be 
described by extensions to the present framework. Moreover, the present framework 
is also limited to smooth processes, although again the associated extensions can be 
contemplated so as to treat processes with various discontinuities in field variables. 

Continuum descriptions for SMM materials are now abundant, although the 
particular multifield framework presented here is apparently new. The variety of 
continuum models for SMM are reviewed in various survey papers (e.g., Huo and 
Muller, 1993; Fischer et al., 1996; Bemardini and Pence, 2002a). For the present 
discussion we restrict attention to macroscopic constitutive theories for SMM that 
either employ additional field variables similar to ^a and f a/ or else employ some 
notion of macroscopic transformation strain similar to M. In the latter case this is 
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typically on the basis of mechanical analysis of variant combinations at finer length 
scales. 

Several models have been posed in the framework of continuum thermody- 
namics with internal variables (Coleman and Gurtin, 1967; Rice, 1971) where the 
internal variables are quantities introduced in order to take into account some of the 
microscopic scale phenomena and which are considered as arguments of the stan- 
dard response functions. They are constrained by ordinary differential constitutive 
equations which are, usually, explicit in their rates. In the context of SMM, internal 
variables typically include one or more phase fractions or macroscopic transformation 
strains. The evolution of the phase transformations may then be governed by ordinary 
differential equations in the internal variables rates, the phase transformation kinetics, 
which are prescribed* constitutively at the outset. 

Early application of the internal variable formalism to SMM focused on account- 
ing for uniaxial isothermal pseudoelasticity through the use a single scalar internal 
variable equivalent to (Tanaka et al., 1986; Liang and Rogers, 1990). This inter- 
nal variable then evolves in response to changes in temperature and to changes in 
a single component of stress. This evolution in these early treatments is governed 
by a separate phase transformation kinetic. The kinetic is designed so as to initiate 
and conclude the A to M and M to A transformations in appropriate regions of a 
temperature-stress phase diagram. A limitation of this approach is that it is only able 
to model a Martensite with fixed transformation strain, and thus unable to account 
for both positive and negative stress and also unable to model the complete shape 
memory effect. Even so, the inherent simplicity of this type of model has led to its 
wide use for several applications exploiting the pseudoelastic effect under positive 
uniaxial isothermal loading. By incorporating a second scalar internal variable to al- 
low for the distinction between stress-induced (or oriented) Martensite and thermally 
induced (or unbiased) Martensite, these models can be extended so as to allow for the 
shape memory effect in the context of uniaxial stress states and isothermal conditions 
for mechanical loading (Brinson, 1993). 

The models mentioned above directly pose stress-strain-temperature relations 
with little attention to an overarching energetic structure. Such structure can be pro- 
vided in the context of uniaxial pseudoelasticity via minimization of a macroscopic 
Helmholtz free energy function with dependence on the Martensite fraction as in 
Muller (1989). The resulting model was among the first to provide explanation for 
the internal subloops observed under incomplete phase transformations. The approach 
can in turn be extended to three-dimensional stress states by providing richer expres- 
sions for the free energy and formally positing a kinetic for the Martensite fraction 
with further refinement provided by decomposition of into stress-induced and 
thermally induced portions (Raniecki et al., 1992; Raniecki and Lexcellent, 1994). 
Other developments involve the use of Martensite fractions subject to constitutive 
assumptions that allow also for reorientation of the Martensite variants (Boyd and 
Lagoudas, 1996). 

Many of the above models focus on isothermal behavior and also on loading 
paths that involve complete phase transitions. A description of pseudoelastic behavior 
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under arbitrary uniaxial thermomechanical loads can be given through the use of a 
transformation kinetic for the phase fraction that is based on Duhem-type differential 
equations (Ivshin and Pence, 1994a, 1994b). This framework allows for a systematic 
modeling of internal subloops and their shakedown behavior, as well as the tem- 
perature change induced during mechanical loading in adiabatic conditions and in 
conditions involving convective heat transfer. Extensions that account for true shape 
memory, low temperature reorientation, and tension/compression asymmetry can be 
accomodated by extending the framework so as to decompose the Martensite phase 
fraction into two separate fractions representative of different Martensite variants of 
opposite orientation (Wu and Pence, 1998). 

A standard feature of the above mentioned internal variable models is a transfor- 
mation kinetic for the phase fraction (either or ^m) that is specified at the outset as a 
direct constitutive assignment. A different approach involves a transformation kinetic 
that is instead derived from a dissipation response function for the rate of entropy pro- 
duction, after enforcing restrictions induced by the balance equations (Rajagopal and 
Srinivasa, 1999).^ Consequently, the constitutive prescription reduces to the specifi- 
cation of a free energy function and a dissipation function. While there are standard 
forms for free energy in the context of uniaxial stress, standard expressions for the 
dissipation function are lacking. Alternative forms for the dissipation function can 
capture different hysteretic effects and provide correlation to various thermodynamic 
process quantities (Bemardini and Pence, 2002b). 

Less developed than the notion of a transformation kinetic for a phase fraction 
scalar or a collection of phase fraction scalars is the notion of a transformation kinetic 
for a tensor transformation strain internal variable such as M. A notable exception in 
the context of infinitesimal strains is the model of Bondaryev and Wayman (1988), 
which is also remarkable for its relatively early appearance. Their model provides an 
account, in a three-dimensional setting, of both pseudoelasticity and reorientation. It 
is based on the consideration of phase transformation strain as an internal variable so 
that phase fractions then arise as multipliers of the corresponding flow rule. Further 
developments in this direction have been given, by a suitably generalized plasticity 
theory, in Lubliner and Auricchio (1996) and their subsequent works where also finite 
element implementation is given. 

With regard to the notion of describing a macroscopic tensor transformation 
strain, much effort to date has focused on its determination on the basis of the current 
local stress state using an effective continuum approach. Within this framework, each 
point of the macroscopic continuum is put in correspondence with a representative 
volume element (RVE) of a multiphase elastic material subjected to local transforma- 
tion strains that represent the effect of the difference in the crystallographic structure 
of the phases. At the single crystal level, an Austenite matrix can support a specific 
finite number of Martensite variants, the specific number based upon the reduction 
of symmetry in the A to M transformation. Each M variant then has a characteristic 



^ A further feature of the model is a more systematic conception of transformation strain that is acheived 
by allowing the material to have multiple natural configurations. 
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transformation strain with respect to a common A parent phase. This by itself does 
not allow a direct description of macroscopic transformation strain in a polycrystal, 
nor does it address certain issues related to crystallographic phase interfaces.^ 

Macroscopic descriptions of shape memory material behavior can be acheived 
by considering averaging procedures upon assemblies of grains in an RVE where each 
grain is endowed with an appropriate microstructure (Patoor et al., 1988; Siredey et 
al., 1999). Numerous types of microstructural arrangements and averaging procedures 
can be contemplated in this framework. These include RVEs consisting of nontex- 
tured assemblies of spherical grains with averaging based upon self-consistency. The 
effect of specific texture can then be simulated using experimentally determined grain 
orientation distribution functions (Gall and Sehitoglu, 1999). At the level of a single 
crystal, an A matrix with M inclusions made of groups of self-accomodating variants 
can be arranged so as to resemble experimentally observed wedge-like microstruc- 
tures prior to averaging (Gao and Brinson, 2000; Gao et al., 2000). Alternatively, 
averaging can be performed on polycrystals with an RVE consisting of a collection of 
grains, each wholly in the A phase or wholly consisting of a single M variant. Although 
the local transformation strain of each grain is then not strictly crystallographic, the 
resulting description is suggestive of SMM behavior both in the high and in the low 
temperaturature ranges (Sun and Hwang, 1994). 

Dissipation can be included in such modeling by additional averaging over an 
internal time scale that is representative of the transformation duration in order to 
obtain an average dissipation rate and driving force. The phase transformations may 
then be governed, macroscopically, by overall nucleation and propagation energetic 
criteria that are obtained after integration of the corresponding local criteria over the 
RVE and over the propagating interfaces. The final evolution equations then follow 
by invoking an extremum principle, closely related with a maximum dissipation 
requirement (Levitas, 1998). 

Different elastic properties of the A and M phases can be accommodated in these 
various averaging techniques, which in turn generates differences in the structure of 
the macroscopic free energy. In particular, free energies for the above referenced 
multidimensional treatments can follow from volume averaging over boundary value 
problems of heterogeneous elastic materials with prescribed eigenstrains (Bernardini, 

2001) . Macroscopic models then follow when such treatments are augmented with a 
suitable phase transformation kinetic. 

In this work the macroscopic thermomechanical behavior of SMM is modeled 
within the framework of multifield theories (Capriz, 1989; Capriz, 2000; Mariano, 
2001 and references therein) using microstructural descriptors consisting of both 
scalar and second-order tensor fields. This approach does not seem to have been pre- 
viously pursued with reference to SMM. A specific feature of this approach is that 

^Although not germane for the purpose of this article, it is worth noting that individual phase interfaces 
are often modeled with the aid of nonconvex or multi well stored energy functions. Analysis of individual 
phase domains with distinguishable phase interfaces can then occur in both static situations (e.g., Ericksen, 
1975; Falk, 1980; Ball and James, 1987; Bhattacharya and Kohn, 1997; Truskinovsky and Zanzotto, 

2002) and dynamic situations (e.g., Abeyaratne and Knowles, 1991 ; Pence, 1992; Rosakis and Tsai, 1995; 
Truskinovsky, 1997). 
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the descriptors are regarded as truly observable quantities which therefore directly 
contribute to the balance equations. Each microstructural descriptor is governed by 
its own balance which is required to be consistent with the observer invariance prin- 
ciple. The framework also naturally allows for spatial weak nonlocality by virtue of 
a possible dependence on spatial gradients of the microstructural descriptors. This 
allows also for refined descriptions of microstructural localization, which is an im- 
portant issue in modeling SMM. Some connection with certain aspects of the present 
work can be found in Fried and Gurtin (1994) and in Fremond (1996) where balance 
equations are also proposed for microstructural descriptors that, in their cases, are re- 
stricted to be scalar in nature. Here, in addition to the consideration of a second-order 
tensor order parameter and certain other basic differences with respect to the general 
development, we also explore additional issues with respect to the constitutive theory. 

Specifically, as remarked above and developed in Section 9.3, the present model 
takes as descriptors of the underlying microstructure two scalar fields repre- 

senting the volume fractions of the Austenite and Martensite phases and a second- 
order tensor field M that defines the features of the particular arrangement of Marten- 
site variants. The macroscopic displacement is described by the placement field x 
and the overall framework is required to satisfy an observer invariance principle as 
discussed in Section 9.4. In Section 9.5, scalar balance equations are presented for the 
fields and a second-order tensor balance equation is presented for the field 

M. These accompany the usual balance equations for linear momentum, entropy, and 
energy. Linear momentum balance is a vector equation that is regarded as the balance 
associated with placement x, while the entropy balance introduces the temperature 
field into the formulation. The general constitutive structure as elaborated in Sec- 
tion 9.6 involves, as customary in multifield theories, the specification of response 
functions for the volume and surface densities of the interactions and, in addition, also 
includes a constitutive function for the entropy production. The fields M are 

typically subject to various constraints that in turn give rise to reactive terms in the 
response functions. The effect of these constraints, as well as possible constraints on 
X, are considered in Section 9.7. Then in Section 9.8, restrictions on the constitutive 
functions are derived from the balance equations. It is shown that the set of response 
functions is completely determined in terms of two scalar functions: the free energy 
function and the entropy production function. 

In contrast to various other treatments for SMM, phase transformations are not 
governed by relatively ad hoc criteria but rather, as discussed in Section 9.9, are 
governed by the response functions for the free energy and entropy production via 
the microstructural balance equations. The hypothesis of rate independence for SMM 
transformation introduces natural memory parameters into the constitutive description 
for the rate of entropy production. This accounts for hysteresis with respect to the fields 
M and so in particular describes incomplete transformation effects and the 
generation of internal subloops in stress-strain response. The requirement of observer 
invariance puts additional restrictions on the constitutive theory that identify natural 
combinations of field variables for the constitutive dependence. An explicit example 
involving linearized kinematics is presented in Section 9.10, which under appropriate 
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specializations reproduces previous descriptions for pseudoelasticity and the shape 
memory effect, while at the same time, providing a treatment for reorientation of 
Martensitic variants at both low and high temperature. 

9.2. Notation 

The following conventions for tensor algebra will be used henceforth (Murdoch, 
1978). Let E represent the three-dimensional Euclidean space with translation space 
Ve and scalar product • and let a, b, c, d, e, f € Ve be six arbitrary vectors. Sim- 
ple second-, third-, md fourth-order tensors are thus defined via the dyadics in the 
standard way: 

a(g)b, a0b(g)c, a0b0c0d. 

The corresponding general tensors and all the corresponding operations are defined 
by linearity on a basis for Ve- The scalar products between tensors of second- and 
third-order are denoted by the same symbol • used for Ve and, for simple tensors, reads 

(a 0 b) • (d 0 e) = (a • d)(b • e), 

(a 0 b 0 c) • (d 0 e 0 f) = (a • d)(b • e)(c • f). 

Various types of contractions may apply; those used in this work are denoted by o, o, 

3 

o and are defined, on simple tensors, as follows: 

(a 0 b 0 c) o (d 0 e) = (c • d)(a 0 b 0 e), 

(a 0 b) o (c 0 d 0 e) = (b • c)(a 0 d 0 e), 

(a 0 b 0 c) o (d 0 e) = (c • d)(b • e)a, 

(a 0 b) o (c 0 d 0 e) = (b • c)(a • d)e, 

(a 0 b 0 c 0 d) o (e 0 f) = (d • e)(c • f)(a 0 b), 

(a 0 b 0 c) o (d 0 e 0 f) = (c • d)(b • e)(a • f). 

Moreover, two types of transposition on third-order tensors are considered 
(a 0 b 0 c)^ = (c 0 b 0 a), (a 0 b 0 c)" = (b 0 a 0 c), 

the other possible transpositions being obtainable by their composition. Transposition 
operations and T are defined on fourth-order tensors via 

(a 0 b 0 c 0 d)~ = (d 0 b 0 c 0 a), 

(a 0 b 0 c 0 d)^ = (b 0 a 0 c 0 d), 

(a 0 b 0 c 0 d)^ = (c 0 d 0 a 0 b). 

All these tensorial notions can be extended also to the case in which different 
copies of E are involved by considering combinations of vectors belonging to various 
translation spaces. For ease of notation, the distinction between the various types of 
tensors will be explicitly stressed only when strictly necessary. Specifically, this will 
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be done in the case of two-point second-order tensors which are defined relative to 
E and to its copy A, the latter with translation spaces Va- If g € Va, then both 

a (g) g and g (g) a 

furnish such a two-point tensor. Then one may interpret a g) g € Lin(VA» Ve) and 
gga G Lin(VE, Va). 



9.3. Kinematics 

The position in space of a body 03 is idealized by a bijection 

( 1 ) 

which associates with each element p endowed with a distinct identity a point of E 
indicated by x = x (p). The mapping x is said to define an apparent placement for 
05 if the set B = x(®) C E, obtained when p spans 05, is a Noll-Virga fit region 
(Noll and Virga, 1988). Material elements p are acknowledged to be endowed with an 
underlying crystallographic structure whose features can potentially affect the macro- 
scopic response of the body. In order to provide an idealization of this circumstance, 
the logical path of multifield theories^ is followed and the configuration of each p is 
characterized, besides its position in space, by further attributes, selected via physical 
considerations, as descriptors of the relevant features of the microstructure. 

There exists extensive experimental evidence of the multiphase nature of SMM 
(Otsuka and Shimizu, 1986; Otsuka and Wayman, 1998) that exhibits arrangements 
of Austenite and variants of Martensite in microstructures of variable complexity 
depending on several factors such as, for example, alloy composition, cold work, 
annealing, and loading history. In the simplest case of single crystals subjected to 
uniaxial stress states, a single correspondence variant pair of Martensitic variants is 
usually observed (Gall et al., 1999), whereas for polycrystals under three-dimensional 
stress states, the microstructure generally appears as an arrangement, within a grain 
structure, of Austenite regions and many groups of variously twinned Martensite 
variants (Fischer et al., 1994; Shield, 1995). 

For the purpose of the present modeling, material elements are understood as 
composed of an arrangement of an Austenite phase with a Martensite phase where, in 
turn, the Martensite phase is endowed by further structure reflecting the possibility of 
being composed by an assembly of variants. This kind of microstructure is introduced 
in the theory by means of two kinds of descriptors : the volume fraction of the Austenite 
and Martensite phases and a tensorial quantity that identifies the particular kind of 
Martensite under consideration. 



^The idea to endow material elements with further degrees of freedom is a long-standing one and, histor- 
ically, emerged in a variety of contexts. In this work we refer to the approach pursued in Capriz (1989), 
Capriz (2000), Mariano (2001); however, various lines of development of the idea can be found in the 
literature as, for example, in Eringen (1999) and the references therein. 
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The volume fractions of Austenite and Martensite are specified by a continuous 
mapping 

z:f8^ZcR2 

which associates with each p an ordered pair of real numbers ^m) = ^(P)« The 
set Z collects the physically admissible pairs also indicated by the bold 

letter The structure of Z is specified by equality and inequality constraints, i.e., 
z = z"‘inzi” with 

Z®'* := 6 = 0 O' = 1, . . . , Meq)}, 

Z" := {| e <00 = 1,..., Mi„)}, 

where and are suitably smooth, scalar- valued, constraint functions so as to 
render Z^” convex. Typical constraint functions that arise naturally from the definition 
of the phase fractions involve Meq = 1, Min = 4 with 

W^ = lA+^M-h ( 2 ) 

Each set 2 =^(Q3) of elements belonging to Z is said to be a phase distribution for 
03, while the mapping'? is a phase assignment for 03. 

The variant assembly within the Martensite phase is described by a continuous 
mapping 

m : 03 M c Lin(VA, Ve) (4) 

which associates with each p a two-point second-order tensor M = m(p). The set M 
of the physically admissible values of M is given by M = with 

:= {M e Lin(VA, Vg) | gf{M) = 0 (; = 1, . . . , A^eq)}, 

(5) 

M‘" := {M € Lin(VA, Ve) | gf(M) < 0 O' = 1, . . . , Mn)}, 

for suitably smooth, scalar-valued, constraint functions gj^ and gj so as to render 

convex. In (5) Va and Ve are the translation spaces of two copies, A and E, of 
the three-dimensional Euclidean space such that E is the space in which the actual 
position of ® is represented (see (1)) whereas A is the representation space for the 
position of 93 when it is in a fully Austenitic state. The tensor M may then be inter- 
preted, locally, as arising from a homogeneous microscale transplacement that would 
convert an entirely Austenitic patch into another one composed completely of the 
Martensite under consideration."^ In particular, M = I is associated with a Martensite 
that yields no macroscopic deformation (i.e., self-accommodated), whereas M I 
implies that the Martensite is associated with a collection of variants giving rise to 

^This interpretation is an attempt to introduce a purely kinematic description of the effect of the microstruc- 
ture on the macroscopic behavior. This task can be formalized rigorously by following, in a more explicit 
way, the approach developed in Del Piero and Owen (1993). 
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a net macroscopic deformation with respect to the Austenite. As regards constraints 
upon M, an example motivated by the case in which microscale transplacement is 
volume preserving and limited by crystallographic lattice shifts, is given by Aeq = 1, 
Ain = 1 with 

gJ‘’(M) = detM-l, (6) 

= (M^M - 1) • (M^M - 1) - (g*)^, (7) 

where g* is characteristic of the maximum “effective” lattice shift between the par- 
ticular Martensite under consideration and the Austenite. Each set M = m(Q5) of 
elements belonging to M is a Martensite distribution for ® , while the mapping m is 
a Martensite assignment for ® . 

The quantities M are therefore regarded as a complete representation of 
the microstructure underlying each material element. They are independent since, 
at each point, independent variation of $ and M may occur reflecting physically 
meaningful different phase transformations. Specifically, a variation of $ corresponds 
to a proper Martensitic transformation (i.e., conversion of Austenite into a fixed kind 
of Martensite or vice versa), while a variation of M indicates that the kind of Martensite 
is changed and so provides the other typical transformation observed in SMM, which 
is usually called Martensite reorientation (Bemardini and Pence, 2002a). Martensite 
reorientation provides the only transformational change when the material is in its 
low temperature range. 

Summarizing, the assignment of a configuration of the body is defined by the 
action of the mappings m which assign to every p g 05 a triple 

C(p) := {x(p). %). n»(P)) e E X Z X M. 

Clearly, the configuration of the whole 05 is the triple of subsets {S, Z, M], respec- 
tively, of E, Z, and M. Such a configuration for 05 is said to be admissible if S is a fit 
region. 

Given any two apparent placements B\ and B 2 , the apparent transplacement 
from B\ to B 2 is the bijective mapping X • ^2 such hat 

X(Sl) = S2. (8) 

Therefore, taking any apparent placement TZ as reference, a material point labeling can 
be established so as to yield mappings x > from Tl to any admissible configuration 

B, Z, M in such a way that 

^ = Z = z{TZ), M = m{Tl). 

In what follows, the apparent placement Ti associated with undeformed Austenite 
will be considered as reference. The elements of TZ will be denoted by X. 

A motion is a suitably regular time-parameterized family of configuration map- 
pings 



X(X,0, z(X,0, m(X,r) 
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such that X = x(X, t), | = z(X, t), and M = m(X, t) denote the place and the 
microstructure at time t of the material point that occupies the place X in 'R. Recall 
also that, more explicitly, ^ stands for 



= Za(X, t), = ZA/(X, t). 

Provided suitable regularity is available, space and time derivatives of motions can 
be evaluated (at least almost everywhere) giving rise to velocities 



V := 



a x(X,o 

dt ’ 



:= 



azA(X,o 

dt 



__ SzmO^^ 0 am(X, t) 






dt 



dt 



and gradients 

_ dx(X,t) . _ dZA(X,t) . _ dZM(X,t) M _ am(X, 0 

ax ’ * ax ’ ’ ax ’ * ax ’ 

( 9 ) 



where are scalars; v, are vectors; v^, F are two-point second-order 

tensors and F^ is a third-order tensor. The standard condition 



detF > 0 



follows from the invertibility of x upon excluding material inversion. Additional 
equality constraints upon the placement may also apply in the form 

q"‘l(F)=0 (k = l,...,Peq), 

( 10 ) 

q"(F)<o (k = i,...,Pi„) 

for suitably smooth constraint functions and CJ,". For example, a requirement of 
isochoric deformation involves Peq — 1 » Pin = 0 with 

Cj‘'(F) = detF- 1. (11) 



9.4. Observer dependence and invariance requirement 

The above introduced kinematic fields, as well as the fields to be introduced in what 
follows, are all susceptible to different representations by different observers. The 
term observer here is understood as synonymous with the i&xm frame used in Capriz 
and Virga (1990). Henceforth, attention will be restricted to observers that agree 
upon the representation of spatial distance between points belonging to the same 
instantaneous apparent placement. This is consistent not only with the Capriz-Virga 
family of rigidly related frames but also with the interpretation in Murdoch (1982). 
For the sake of simplicity, it will be assumed that all observers agree also on the 
representation of time. 

Specifically, let the position of a generic material point p at time t be described 
by the places \{t) and x“*" (t) for two separate observers. Since, by the above definition, 
observers agree upon the simultaneous distance between points, there exists a vector 
c(0 and a proper orthogonal tensor Q(t) such that 

x+(0 = c(0 + Q(f)x(0- 



( 12 ) 
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Similarly, the placement of p in the reference configuration is represented as X”^ and 
X by the two observers, respectively, with 

x+ = Co + QoX (13) 

for vector cq and proper orthogonal tensor Qo (Murdoch, 2000). The different repre- 
sentations of the apparent transplacement gradient are then related by 

F+ = QFQl 

According to Capriz and Virga (1990), it is assumed that the possible observer’s 
representations of the microstructural descriptors are related by a set of transforma- 
tions that form a continuous group that is the homomorphic image of the set of proper 
orthogonal second-order tensors under a suitable mapping that is to be specified on 
the basis of the physical significance of the descriptors. This assumption embodies 
the idea that the microstructural phenomena under consideration are unaltered by 
observer’s translations. 

Taking into account the nature of the phase fractions and of the tensor M, it is 
stipulated that the microstructural descriptors are observer-independent (objective) in 
the sense that the relations among the different representations given by the observers 
obey the standard tensorial transformations (Ogden, 1997) 

(Ia)+=|a, = M+ = QMQJ. (14) 

A basic postulate underlying the whole theory is that every observer agrees on the 
physical behavior of the body, in the sense defined by the following: 

Observer invariance principle: All observers must agree on the 
judgment as to the validity of any statement involving quantities 
susceptible to different representations by different observers. 

Extensive usage of this principle will be made in the following with respect to balance 
equations, constitutive equations and internal constraints. 



9.5. Balance equations and entropy inequality 

Balance equations for mass, linear momentum, energy, and entropy are assumed to 
hold as for standard continua (Green and Naghdi, 1977; Muller, 1985; Silhavy, 1997). 
In addition, according to Capriz (1989), the variations of configuration associated with 
the change in the microstructural descriptors are assumed to give rise to balanced 
microscopic linear momenta. The specific form of the balances is summarized in the 
following: 

Balance principle: Let V be an open subset, with suitably regular 
boundary dV, of the reference apparent placement 7?. of 95 and let T 
be some open time interval of interest. Then there exists an observer 
relative to which the equations 

)S(o)(7>, dVj) = 0 
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hold for every 'P c 72. and almost every time t e T, where 

P(o)(V. aP, 0 := C(o)(P, 0 - <i>(o)(aP, 0 - 5(o)(P, 0 - P(o)(P, 0 (15) 

and(o) spans {p, p, p^, e, r;}, respectively, denoting mass, 

macroscopic linear momentum, microscopic linear momenta asso- 
ciated with with , with M, energy, and entropy.^ 

The various symbols introduced in (15) have the following physical meaning 
relative to the field (o): C(o) denotes the content in P at time t\ <I>(o) denotes the 
rate of exchange through the surface 9P with the external environment at time t\ 
S(o)('P, t) denotes the rate of supply to P by the external environment at time f; and 
P(o)(P» 0 denotes the rate of production to P at time t. Moreover, <E>(o), 5(o), and 
P(o) are assumed positive if the corresponding exchanges tend to increase the content 
of (o). Each of the four quantities involved in the balance equations are assumed 
to be absolutely continuous with respect to the volume and surface measures in the 
reference placement so that 

C(o)(P,0:= f C^o)(Xj)dV, cD(,)(aP,0:= f ^(o)(X, n(X), O^A, 

Jv JdV 

5(o)(P,0:= I 5(o)(X,0^V, P(o)(ap,0:= f P(o)(X,O^V, 

Jv Jv 

where n(X) is the outward unit normal to the reference surface 9P at X, C(o) are 
the field densities, 0(o) are the surface exchange rates, 5(o) are the bulk supply rates, 
and P(o) are the bulk production rates. For the various balances, these densities are 
assumed to be represented as follows. 

Mass is a field that involves neither a surface exchange nor a bulk supply nor a 
bulk production; i.e., 

^^(X, n, 0 := 0, 5^(X, 0 := 0, P^(X, 0 := 0. 

Setting := p, the arbitrariness of P gives a local form of the mass balance as 

P(X, 0 = 0. (16) 

This property permits the remaining fields of linear momenta, energy, and entropy to 
be specified in terms of mass density rather than volume density. 

Concerning macroscopic and microscopic linear momenta the assumptions are 
summarized in the table 



(o) 


<^(o) 


S{o) 


P(o) 


p 


t 


ph 


Pt 


p^A 






pY\^^ , 


P^M 


iHm 


pb^ix 


pO?" 


pM 


jM 


ph^ 





^Clearly, the symbol P(o) is assumed to accommodate different tensorial orders as appropriate to the nature 
of the fields (o). 
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where t is the nominal traction and b is the body force per unit mass in the reference 
placement as is standard in describing balance of macroscopic linear momentum 
with respect to a reference placement. Also are interpreted as generalized 

tractions; as generalized body forces per unit mass; and , 0^^ , 11^ 

as generalized self-forces per unit mass. In addition, suitable expressions must be 
given for the content quantities C(o). Mass density gives the macroinertia associated 
with positional change and the standard expression Cp := p\ applies. Corresponding 
expressions are needed, in general, to describe inertia associated with the change 
of microstructure (Capriz, 1989). As our purpose here is limited to a treatment of 
quasistatic material behavior, all of these fields will be regarded as vanishingly small, 
i.e., Cp = 0, C = C CpM = 0. Summarizing, the above assumptions 
yield 



^p = - 


/ 

JB-p JT 


p(h + {)dV, 

) 




f t^^dA - 
JdV 


f p(b^^ + n^^)dV, 
Jv 




f t^'^dA - 
JdV 


f pib^’^ + n^^)dV, 

JV 


^pM = - 


f t^dA- 
JdV 


( pih^ + H^^)dV. 
IV 



By assuming the continuity of the dependence of the surface exchange rates 
upon the normal, the balanced nature of the interactions ensures that a generalized 
Cauchy theorem holds also for the microstructural fields (Capriz and Virga, 1990). 
This, together with suitable transformations to the reference placement, provides the 
existence of referential macro- and microstresses such that 

t = Sn, • n, . n, = S^n, 

where S is the first Piola-Kirchhoff stress tensor, while the vectors and and the 
third-order tensor are the generalized microstresses with respect to the reference 
placement. 

As regards the energy balance, energy is taken to be a field that does not involve 
bulk production; i.e.. 



Pe=0. 

In the present context, this is regarded as the appropriate statement of the first law 
of thermodynamics. The remaining energy fields C^, 5^, are all decomposed into 

a part that involves the macrorate v, the microrates vm, and a part that 

is independent of these resolved rates. The latter are referred to as the thermal part. 
Energy exchange through a surface and energy supply through the bulk are associated 
with configurational changes involving not only change of place but also change of 
microstructure. Hence, both the surface exchange density rate and the bulk supply 
density rate are given by the sum of the contributions associated with each possible 
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type of configurational change. Accordingly, the energy content density is 
Ce-.= pe + K + 

where the field e is the internal energy per unit mass and K, are 

the kinetic energy fields associated with rates of position change, phase change, and 
Martensite orientation change, respectively. These kinetic energy fields involve the 
notions of macro- and microinertia alluded to above, which play no part in a quasistatic 
description and so are also neglected in what follows; i.e., AT = = 

0. Energy exchange through a surface and energy supply through the bulk are again 
assumed to be given by the sum of contributions associated with each possible type 
of configurational change as well as a thermal part, i.e., 

:= p(b • V -f -f -f- • v^ + r), 

:= t • V -j- -h -f- -h A, 

where r and h are the thermal energy bulk supply and surface exchange, respectively. 
In combination this gives 

Pe = ^ ( pedV - f t • V + 4- hdA 

dt J-p Jsp 

— f p(h b^^v^^ -{■ b ^^ -f b^ • + r)dV 

Jv 

The dependence of h upon normal n is again determined by a Cauchy theorem as 

A = -q • n, 

where q is called the heat flux vector. 

As regards the entropy balance, the surface exchange and the bulk supply 
Srj are assumed to be proportional, respectively, to h and r which are the analo- 
gous thermal quantities in the energy balance.^ The inverse of the common positive 
proportionality factor is called the absolute temperature and thus 




Further, defining r] as the specific entropy content and F as the specific entropy 
production, the content and the production terms are 

Cfj := pr], P rj pP > 

so that finally 

In addition to the above balance equations, the second law of thermodynamics 
gives an entropy inequality requiring, as customary, a nonnegative rate of entropy 



^According to Muller (1967), different assumptions can be made in this respect. 
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production; i.e., 

P, = /* ptdV > 0. (17) 

Jv 

Here and in what follows it is assumed that all fields are sufficiently regular to 
support the analytical tools, explicitly or implictly, invoked. Thus, the arbitrariness 
of V yields the following local statements 

Div(S^v + + (S^)^ o — q) 

+ p(b • V + + r) = pe, 

DivS + p(b + f) = 0 , DivS^ + p(b^ + = 0 , (18) 

Div + p(b^^ + n^^) = 0, Div + p(b^^ + H^^) = 0, 

-Div(|) + p(^ + r) = p»?, pr>0. 

After inspection of the previous relations, it is recognized that 23 fields have 
been introduced. In order to facilitate the next developments, any ordered 23-ple of 
fields defined onTZxT and belonging to a suitably smooth function space Y is said to 
be a possible process for ® and it is denoted as y := {X, C, f , p, S} where the notation 
emphasizes the different role played by the fields within the theory. In particular, 
omitting the dependence upon (X, t), 

configuration fields: X := {x, i?, M}, 

contents: C := {p, e, t]}, 

fluxes: f := {S, q, S^}, 

productions: p := {f, F, fl^^, 0^^, 11^}, 

sources: S := 

The set of possible processes Y collects all the 23-ples of fields whose elements, 
individually, can be realized in some circumstance. However, in general, while the 
realization of each single field is possible, not all combinations y describe real situa- 
tions. The processes which describe physically meaningful circumstances are called 
admissible thermomechanical processes and belong to a set C Y which will be 
singled out from Y after enforcing various requirements: balance equations, constitu- 
tive equations, internal constraints, and observer invariance. To this end, the effect of 
balance equations on processes can be emphasized by writing compactly (y) = 0, 

with appropriate (o), for the balance equations in (18), and (y) > 0 for the entropy 
inequality again in (1 8). 

The principle of observer invariance is now applied to the balance equations. 
This will yield restrictions on the possible processes so as to ensure that all observers 
agree on the validity of the balances. 

Specifically, let the balance equations associated with the observers measuring 
placement and x be written, respectively, as = 0 and jS(o) = 0, where symbols 

are defined in the obvious way in terms of the individual fields as measured by the 
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observers, for example, 

Pe= f p(e - b • V - - ?)dV 

Jv 

— f t- v + +t^ • v^ + WA. 

JdP 

As regards the balance equations for the quasistatic theory, the observer invariance 
principle thus requires that for every pair of observers, 

= 0 ^ )8(o) = 0. (19) 

Transformation rules that augment ( 1 2)-( 1 4) for the additional fields appearing in ( 1 9) 
are derived from the assumption of their objectivity. Material time differentiation of 
the transformation rules for the configuration fields yield the corresponding rules for 
the various velocity terms 

= Q(v + u + S^x), , 

(yM)+ ^ ^ \^)QI, 

where u ;= Q^c and S2 := Q^Q, a skew second-order tensor. The scalar field 
quantities i^, p, /i, r, e, r], F share the same representation 

1 ^+ = ^, p+=p, (i+=h, r+=r, e+=e, r}+ = rj, T+ = f , 

(20) 

while recalling that the theory is quasistatic, the standard and generalized tractions, 
body forces, and self forces obey 

t+ = Qt, (t^-^)+ = , (r^W)-l- = (jM)+ ^ 

b+ = Qb, (fc^'")+ = b^'' , (fe^")+ = , (b'^)+ = Qh^Ql, 

f+ = Qf, (n^^)+ = n^'<, (n^*^)+ = n^", (n'^)+ = Qn'^gJ. 

It is immediate from the above expressions that 

fit = Qi®P’ fit^ = Qfit^QL fit = fin’ 

whereupon the balances associated with p, p, p^, , p^, tj are automatically ob- 

server invariant. As regards observer invariance of the energy balance, it is noted if 
Qo = I that 

t+ • v+ = t • V -h t • u -I- 1 • nx, 
b'*' v'^ = b v-l-b u-t-b - Qx, 

(t^)+ . (vM)-l- = 

(bM)+ . = feM . -M j^M . 
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whereupon elementary manipulations give 

fiT = ^e- f t • U 

JdV 



+ 1 Qx + 



L 



p(h • u + b • + b 



,M 



•aUdA 

• au)dv. 



Terms involving Q can be rewritten as 
t • S2x = 12 • (t (g) x), 

• fiM = ft • 



b • 12x = 12 • (b 0x), 
ftM = ft (b'^M^), 



which in turn gives 



fit - fie 



- 12 • t (g> X + i^M^dA + j p(b 0x4- b^M^)J V j . 



( 21 ) 



The arbitrariness in the choice of the observer measuring placement x"^ implies 
that the expression on the right of the equality in (21) must vanish for arbitrary vectors 
u and arbitrary skew tensors 12. This gives that the first bracketed quantity (a vector) 
must vanish identically and that the second bracketed quantity (a second-order tensor) 
must be symmetric. Further, the arbitrariness in the choice of the observer measuring 
placement x implies that all conclusions with respect to these bracketed quantities 
must hold with the overbar removed. This gives 



skw 




= 0 , 



w 



+ skw f f ph^M^dV = 0. 

JdV JV 



Now, by virtue of t = Sn, the divergence theorem provides 

f DivSH-pbJV = 0 

Jv 

and 

W 

Similarly, by virtue of = S^n, the divergence theorem gives 

2 



= skw / SF^ 4- (Div S 4- pb) 0 xdV. 

Jv 



= skw ( I + (DivS*^ + 

Jv 



( 22 ) 
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hence 

skw [ SF^ + (Div S + pb) ® X + o (F'^)^ + (Div + ph^)M'^dV = 0. 

Jv 

(23) 

Note that the entropy inequality (17) is automatically observer invariant by virtue 
of (20). Under suitable regularity of the various fields involved in (22) and (23), the 
arbitrariness of V leads to the local statements 

Div S + pb = 0, 

(24) 

skw[SF^ + (Div S + pb) (g) X + S (F"^)^ + (Div + pb'^)M^] = 0. 

Thus the relations (24) express the restrictions induced on the processes y by 
the request of observer invariance of the balance equations. As a companion to the 
compact notation = 0 for the balance equations in (18), the observer invari- 

ance requirements (24) will be referred to as B’“(y) = 0 and B^(y) = 0. It is now 
possible to summarize the effect of balance equations on the possible processes in the 
following. 

Definition 1. A process y 6 Y is said to be consistent with balance equations B 
or, equivalently, y G Y® c Y, if, after substitution of y into (18), the resulting 
relations 

l(°)(y)=0, B*'(y)>0 

hold true for every (o) spanning the fields {p, p^*^, p^, e, and almost every 

(X, t) e B xT. If, in addition, 

l"(y) = 0 , l“(y) = 0 , 

then y is said to be objectively consistent with balance equations B, or y € Y^ c Y®. 



9.6. Constitutive assumptions 

Not all processes y € Y® are relevant for modeling the behavior of shape memory 
materials, since the above definition applies equally to all materials whose microstruc- 
ture can be described by two scalars and an objective two-point second-order tensor. 
In order to characterize specific materials and environments, constitutive equations 
are added via relations between the configuration fields X and 

r := {f, r], s, q, , S^, f , T, , 0^*^ , n'^}, 

^ f P (25) 

S = {b, r, 

Here and in what follows, the definition of C is updated so as to eliminate mass density 
p, which is completely determined by (16), and to replace internal energy in favor of 
the Helmholtz free energy 



yj; e — 



(26) 
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According to Noll (1973), constitutive equations for r and S are called, respectively, 
internal if they characterize the constitution of the material and external if they 
characterize the interactions of the body with the environment. Both of them are 
specified according to the following: 

Constitutive principle: Let the symbol a span the fields in {r, S}. 

Then, relative to a given observer, there exist suitably regular con- 
stitutive functions^ such that 

oc(Xj) = CM(X)]; (27) 

i.e., the value of the field a at (X, t) equals the value of the constitu- 
tive function C« at Xa(x), where Xa{x) denotes the result of some 
operation that extracts the arguments of the C^s from the fields X. 

In the most general case, the X^s might be identified with functions that extract 
from X the past history up to the time t so that the constitutive equations would be 
affected by the past history of all the fields X all over the whole body. However, 
usually, this level of generality is not manageable nor actually needed and therefore 
the XaS are restricted by further assumptions that reflect the specific degree of space 
and time nonlocality of the theory. When specialized to r, the above principle requires 
the assignment of 5 constitutive functions 

that characterize an environment, while relative to S, it requires the assignment of 12 
constitutive functions, also called response functions, 

R := {f, S, q, f , f, , fl?" , n*^} (28) 

that characterize a material. Two different observers will, in general, give differ- 
ent representations of the constitutive functions, say Cj and (Murdoch, 1982). 
However, such representations must obey the observer invariance principle in the 
sense that 

a+ = C+[A’„(X+)] ^ a = C„[A;()0], (29) 

where X"^, and X, a are the different representations of the relevant fields as given 
by the two observers (a 6 {r, s}). 

Definition 2. A process y g Y is said to be consistent with the constitutive func- 
tions C or, equivalently, y G Yc C Y if after substitution of y in (27) the resulting 
relations 



a(X,0 = Q[A'c,(X)] 

hold true for every ot g {r, s} and almost every (X, t) e B xT. If, in addition, y 
satisfies (29) for every pair of observers, then y is said to be objectively consistent 
with the constitutive functions C, or y G YcCYc. 

^The term function is used throughout without attempting to pursue the distinctions implied by other 
terms like Junction, functional, operator. The meaning can be inferred from the nature of the domain and 
codomain. 
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If X(C is the set, assumed to be nonempty, of fields X such that all constitutive 
functions C are simultaneously defined, then any process in Yc has the form 

y = {X, R[A'(x)], S[A'(x)]} for some x e Xc 

and it is said that y is induced by the configuration fields X through C. If, furthermore, 
the external constitutive functions S are such that the linear momenta and entropy 
balances are fulfilled for every X and R[A'(X)], then S are said to sustain the process y. 

The particular type of constitutive assignment to be considered here motivates 
the notation 

e := {F, I?}, m := {?A, M, F^^ F"^}, 

which distinguishes between the relevant macroscopic fields, collected in e, from the 
microstructural fields and their gradients, collected in m. Moreover, the referential 
temperature gradient g(X, t) shall denote the derivative of the temperature field with 
respect to X; i.e., 

3i^(X,0 

^ “ ax ■ 

A further notational tool is required to distinguish the values of the fields at (X, f). 

e := {e|(x,/)}. H! := {m|(x,o}- S_ ■= 5l(x,o> 

from the difference history up to the time t at X of the fields m (Coleman, 1964), 
which is indicated as 

md(5) := {(^A, fM, M, F^^ , F?«, F'^)U(X, 5)}, 
where, for example, 

M^i(X, s) := M(X, t — s) — M(X, t) for > 0. 

Summarizing, henceforth the underscore will mean field-evaluation at (X, t) while 
juxtaposition of ^( 5 ') will mean difference history up to the time t at X. 

Taking advantage of this notation, it is now easy to indicate which particular 
XciS are adopted here. Sources are specified by a direct field assignment, equivalent 
to 15 scalar fields, and degenerate into 

Xa := {X, t) for a G S = {b, r, ,\^]. 

The response function for the entropy production F is assumed to depend on the actual 
value of e, g, m, as well as on the past history of m; i.e., 

Xt := {e,g,rn,m(5);X}. 

All other response functions, equivalent to 61 scalar functions, are assumed to depend 
only on the present values of e,g, m; i.e., 

A’a :={e,g,m;X} fora = {i^, /j,S,q,s^\s^",S'^,f, n^'', n*^}. (30) 

The dependence on the past history of the microstructural descriptors is introduced 
in order to model the hysteresis typically observed in SMM, while the fact that this 
dependence here applies only to F has been introduced for the sake of simplicity. 
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In view of the central role that the entropy production function will play in 
the theory, additional structure is now attributed to T to reflect some basic physical 
assumptions. It is assumed that it is possible to resolve additively two uncoupled 
contributions to the entropy production. One contribution F hc is due to the heat con- 
duction. The other contribution Fpt is due to the dissipative mechanisms associated 
with the phase transformations and Martensite reorientation; i.e., 

r(x,o = r//c(x,o + r/>r(x,o. 

Moreover, the dependence on the past history of the microstructural fields applies 
only to the latter contribution so that 

r//c(X, t) = r//c(e,g, m; X), 

TprCX, 0 = fpr(e, m, m(^); X). 

Since F enters the theory, not directly, but only through its time derivative at time t, 
the constitutive functions F hc and Fpp are now further restricted by two hypotheses 
that characterize the present values of their time derivative. First, it is assumed that 
the function Fhc is such that 

f //c(x, 0 = is • g- ni; x)g, (31) 

where K is a positive-definite, symmetric, second-order, tensor-valued constitutive 
function (the heat conduction tensor). 

Second, it is assumed that the function Ppp is rate independent in the sense 
that it is unaltered by time rescalings. More precisely (Owen and Williams, 1968), 
for any increasing continuous function ip of the elapsed time s that maps [0, oo] into 
itself, the function Fpp satisfies 

Fpp[. . . , m^((;p(5'))] = Fpp[. ..,m^(.y)]. (32) 

The hypothesis (32) stems from experimental observations that, to within effects in- 
duced by the thermomechanical coupling, the quasistatic progress of SMM phase 
transformation is practically insensitive to rate of mechanical load application and 
rate of induced temperature change (Otsuka and Shimizu, 1986).^ This observation 
is often regarded as due to the fact that atomic diffusion is not required for Austenite- 
Martensite phase transformation or for Martensite reorientation (Fischer et al., 1994). 
Hypothesis (32) imparts special significance to instants of time at which m( 5 ) un- 
dergoes reversal. For scalar- valued fields this notion is straightforward and can be 
described with the aid of a monotonicity partition of the past into a class of intervals 
during which the field either increases, remains constant, or decreases. When a suit- 
able order relation and a notion of piecewise-monotonicity are available for vector- 
and tensor-valued fields, the set of switching instants is an index set 

and oi spans the fields in m) can be defined as those instants that partition the past 

^Time scales introduced by thermomechanical coupling (e.g., mechanical loading in a convective envi- 
ronment) will however confer rate dependence in this setting and so are consistent with this notion of rate 
independence (Ivshin and Pence, 1994b). 
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into subintervals where the fields preserve their monotonicity. The set of the values 
attained by the microstructural fields at the switching instants is denoted 

m, := {a I, a for every or e m and i e J^]. 

Under suitable regularity assumptions, a time derivative at t of the function T pt can 
be defined according to Owen and Williams (1969) and it is denoted as Fpr, after 
omission of the caret for the sake of simplicity. The following representation for f 
can then be obtained. 

Theorem 1. Let rh := , F^ } and § := {F, ^}. Suppose that 

r PT is rate independent in the above sense and is piecewise monotone. Then 
there exist two continuous functions, and A®, of(e, rn, £Q,) such that 

Fpr[e, m. ni(^)] = A'^(e,m,m^) • m + A®(e, m, m,) -e, (33) 

where the notation • denotes the appropriate scalar products. 

The theorem can be proved by a combination of arguments from Owen and 
Williams (1969) and Brokate and Sprekels (1995) and so provides theoretical justifi- 
cation for the introduction of the discrete memory parameters, here denoted by m, , 
which in the SMM literature have often been introduced in order to describe internal 
subloops (see, e.g., Ivshin and Pence, 1994b; Rajagopal and Srinivasa, 1999). More 
explicitly and for later use, equation (33) can also be written as 

tpT = + A^^Im + A^-y^ + A^^ • 

where it is remarked that all the rates are evaluated at (X, t). 



9.7. Internal constraints 



Various internal constraints have already been introduced in order to delimit the phys- 
ically admissible values of the microstructural fields and of the apparent transplace- 
ment gradient. For convenience, they are recalled below 

«^r(^) = 0, ^•‘•(M) = 0, q"‘’(F)=0, iij,k) = l (Meq, iVeq, Feq), 

<«)<0, gf(M)<0, C*"(F)<0, ii,j,k) = l (Min, Mn, Fin), 

(35) 

which, compactly, can also be written as 

D®^(X) = 0, D‘"(X)<0. (36) 

The observer invariance principle applied to (35) requires that all the constraint func- 
tions satisfy, for all proper orthogonal tensors Q(t) and Qo, 
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[Q(0MQJ], C'*’(F) = q"'^[Q(r)FQj], 

0> = 1) • • • ) (^eqi ^eq)> 

(37) 

gf(M) = gf[Q(t)MQh C*"(F) = Cl“[Q(f)FQj], 

U,k) = h...,(Nin,Pin). 

It is noted that examples (6), (11) satisfy (37). 

The effect of the constraints on the possible processes is summarized in the 
following. 



Definition 3. A process y G Y is said to be consistent with the constraints D or, 
equivalently, y 6 Yjo C Y if after substitution of y into (36) the resulting relations 
hold true for almost every (X, t) e B xT. If, in addition, the constraint functions 
satisfy (37), y issaidtob^objectively consistentwiththe constraints, ory e Yp C Yp. 

For the next developments it is necessary to obtain explicit expressions of the 
effect of the constraints on the present values of the rates of the constrained quantities. 
This is provided by the following. 



Lemma 1. Let y be a process consistent with the constraints (35) for suitably regular 
constraint functions. Then for almost every (X, t) e B xT, 



for any 



such that 



So • F = 0, 

nj* • + sj* • F^ = 0, 

+ si'' ■ F^'^ + = 0, 



n 



Ia 



?A 



n 






:= + n? 

**eq ~ "in 

:= nf" + nf 



Hm 
in ’ 



*0 — ^eq ^ ^in ’ 

Jm 






So 



— ®eq ^ »in ’ 

eM 1 oM 

— ^eq ' ^in ’ 

^eq 



Mec 



= E 



i=l L 



ar'^ + 



A/eq r 



n|? := E 



/=! 



' 9?m 



9^10®“^ t ea 

^F^A . 

dV 






7=1 L 



l2 eq 






eq ^7 , ^7 ^rFM'eq.r 



(38) 



(39) 



(40) 
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and 




with arbitrary scalar fields aj^, arbitrary vector fields Moreover 

the quantities , IlJ^, S^, Sin ha^^ the same structure as (40) and 

(41) but involve arbitrary nonpositive scalar fields a^ < 0, af^ < 0, < 0 and 

arbitrary componentwise nonpositive vectors fields < 0, < 0 whenever the 

associated inequality constraint in (37) is satisfied with strict equality. 

Proof. The contribution of the equality constraints to (3 8)-(4 1 ) follow after differenti- 
ation of the constraints, combination of the derivatives by arbitrary fields a ^^ , , p^j ^ , 

c®.^, and subsequent application of the chain rule. To obtain the contribution of the 
inequality constraints, consider that whenever u;J^[^(X, 01 < 0, ^y^[M(X, 01 < 
Cj^”[F(X, 0] < 0, then each constraint can either decrease, increase, or remain con- 
stant. However, whenever one or more constraints become active, i.e., satisfied as 
equalities, then the right time derivatives of the constraints are restricted to be non- 
positive. The same applies to each single component of the spatial gradient of the 
active constraints. Accordingly, standard complementarity relations follow. 




where the time derivatives are understood from the right and the nonpositiveness of 
the vectors and is componentwise. The relevant part of (38)-(41) then follows 
by application of the chain rule. □ 



This lemma provides a characterization of the effect of the constraints on the 
present values of the rates of the constrained quantities, i.e., v^, 

pM, p arbitrary fields aj^, pl^, a^, pf, cj, cj" are referred 
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to as the reactions of the constraints and are assumed to exist for every choice of 
the configurational fields X. A standard physical interpretation for these reactions 
is in terms of productions and stresses that enforce the constraints. The reaction 
associated with an inequality constraint vanishes when the inequality is strict. Under 
these circumstances the inequality constraint is said to be inactive. In contrast, equality 
constraints are always active. 

For the case (11), the definitions (41) give 

so that is the hydrostatic pressure. For the case (2), the definitions (40) and 
(41) give reactive productions and reactive microstresses for phase transformation in 
the form 



— ^1 ’ ®eq ®eq ^'1 * 

For the case (6), the definitions (40) and (41) give reactive productions and reactive 
microstresses for Martensite reorientation in the form 

= a^M-^ + [(M~^ ® M~^) - (M-^ (8) o o c^f. 



9.8. Characterization of admissible processes 

Combining Definitions 1, 2, and 3, it is finally possible to identify the processes 
actually relevant for the theory. 

Definition 4. A process y g Y is said to be an admissible thermomechanical process 
with respect to balance equations B, constitutive functions C, and internal constraints 
Dif 



y eY^r\Y^nY^:=YACY. (42) 

The aim of the next developments is to find a procedure that can ultimately lead to 
boundary value problems whose solutions describe admissible processes. To this end, 
it is assumed that, for every choice of the configuration fields X, there exist constitutive 
functions Cx such that the process induced by X through Cx is consistent with balance 
equations (Dunn and Serrin, 1985). It follows therefore that the characterization of 
Ya can be reduced to the search for suitable constitutive functions. 

The next lemma provides conditions on the constitutive functions that ensure the 
simultaneous consistency of a process with both balance and constitutive equations. 
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Lemma 2. Let y 6 Yc D Yb be a process consistent with the constitutive functions 
C = {R, S} and with the constraints D. Then y € Y® and only if the external 
constitutive functions § sustain y and the response functions R satisfy 



?= 


1 

1 






s = 


a aF , c 




f = 


0, 




= 0, 


q = 


-pKg, 






= 


drjf 






= 


dx/r 

^3F5'' 


+ p^A'^'’ - 


psi\ 


n?M = 


d\lf 

Hm 


- 


_ 


'°3F«" 


+ p&A^^ - 


-p4^ 


fi^ = 


drjf 

“3M 




+ K^ 




^3FM 


+ p&A^^ - 


ps^. 



for some choice of the arbitrary fields in (40), (41). 

Proof To prove necessity, assume that y = {x, r, s} is consistent with the balance 
equations. Then the balance equations for the linear momenta and entropy are satis- 
fied; i.e.. 



BP(y) = 0, = l^**'(y) =B"(y) = 0, BP'^(y) = 0. (44) 



Now since (44) are affine and decoupled in the sources, one can solve these for S and 
substitute the result into B^(y) = 0 which, after using (26) and the assignments (28), 
gives 



- pf • V - 

+ S • F + 

- pif - = 0 . 



with yjr = d\j//dt and t = dT jdt. Since y € Yp, the previous lemma applies and 
the three equations in (38) multiplied by the respective factors — 1, p, p can be added 
to the left-hand side of (45) without altering its value. By using the chain rule for 
and the representations of f in (31) and (34) it follows that (45) becomes 
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S-p-5f-/.M -Eb-3JT 



- P (?+ ^ + ^-pf-v- ■ g - ^ (q + pKg) • 

S £ S“‘ + ■ '‘) ■’*" 



g 






+ 

+ 



. = i 

7 = 1 

^ 9u;/ _ \ . t 






(46) 



9 ^ , ^9wi 

1=1 



+ - o— 



\ 



For the sake of readability, the effects of the equality and inequality constraints in 
(39) have been condensed into the same symbol. The actual meaning should be clear 
from (39)-(41). The left-hand side of (46) depends on (X, t) via substitution of the 
configuration fields X into the response functions. Since y e Yc, it is induced by 
some configuration field X. Since y is arbitrary in Yc, due to the assumption following 
Definition 3, (46) holds for every choice of X. Then S sustains y. Taking into account 
the arbitrariness of the fields and cjP, df, the rates 

yM^ F^A^, F^, F can take arbitrary values at (X, t) independently from 
X. Hence the parenthesized terms, which in general can depend on X and on the past 
history of the microstructural fields but not on the present values of the above rates, 
vanish, hence establishing (43). Sufficiency follows by direct substitution. □ 



The principle benefit of this lemma is to reduce the constitutive assignment for 
the response functions R to the specification of two functions, namely ^ and F. 

Following Murdoch (1982), objectivity conditions for xjr and F can be enforced 
by the requirement that 

f[. . . , F, M, F'^, M(^), F^(^)] = fg 



(47) 
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with 



fQ-.= ^(...,QF,QM,QiF'^) 

Fq := f[. . . , QF, QM, Q i F^, Q(^)M(5), Q(s) o fI^C^)] 



for every proper orthogonal tensor Q and every proper orthogonal tensor history Q(^) 
such that Q(0) = Q. It follows by direct calculation on (43) using (47), (48), (37) 
that the remaining response functions in R are then also objective. 



Definition 5. A set of constitutive functions C = {R, S} is said to be an admissible 
constitutive assignment if, for every y €¥([;;: (i) R obey (43) and (47), (ii) (y) > 0, 
(iii) S sustains y. 



We are now in position to characterize admissible processes. 

Theorem 2. Let C be an admissible constitutive assignment, then a process y e Y 
is an admissible thermomechanical process for B over T if and only if 

lP(y) = 0, BP^'"(y)=0, BP^"(y) = 0, BP'^(y) = 0, B"(y) = 0 (49) 



for almost every (X, t) e B xT. 



Proof Necessity is immediate. As regards sufficiency, there is only the need to prove 
consistency with the energy balance and observer invariance. The observer invariance 
of constitutive equations is ensured by hypothesis via Definition 5. The condition 
B^(y) = 0 follows from (43) in conjunction with the subsequent discussion. Due to 
f = 0 and f = f the condition B^(y) = 0 gives B“(y) = 0. Therefore, it remains 
only to show that B^(y) = 0. To this end observe from B*^(y) = 0, B^^(y) = 0 that 
B^(y) = 0 is equivalent to 



skw[SF^ + ^ S (F”^)^ - = 0. 



Substituting from the expressions for S, and II gives, after rearrangement. 
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+ p& skw o 



racf*. 



+E^rskw|^-^ 



N, 



eq 



eq 

p skw 



eq 



— p skw 

7=1 

An 

+ E 

k=l 

Nin 

— p y^ skw 

y=i 



3M 

"aci" 



7=1 

:-)J(fM) 



3M 

2„eq 






3F 



^FT 



A^in 






2=1 



9M2 

a«“ 



(F^ic^)' 



IM^ 



9M 



M' 



!£ 

9M 






M' 



= 0. 

(50) 



It can now be shown that each of the bracketed second-order tensors in (50) is indi- 
vidually symmetric. To show this for the first one consider the observer invariance 
on xjf for a parametrized smooth family of proper orthogonal tensors Q(X) so that 



f [. . . , Q(A)F, Q(A)M, Q(A) i F^] = f (. . . , F, M, fI^). (51) 



For an arbitrary skew tensor W pick Q(X) such that 

9Q(A) 



Q(0) = I, 



dX 



U=o = W. 



Differentiating (51) with respect to X and setting X = 0 yields 

-1- i (F^f -1- . W = 0 

L3F ^ ^9M J 



(52) 



Since (52) holds for arbitrary skew W it follows that the bracketed tensor is symmetric. 
Similar arguments using the objectivity of Tpt from (47), (48), and the observer 
invariance of the constraints (37) give the symmetry of the other bracketed tensors 
in (50). □ 



9.9. Statement of specific problems 

After the above discussion of the general aspects of the theory, specific problems 
can now be stated for a given admissible constitutive assignment. Here “to state a 
specific problem'' is understood in the sense “to give a specific choice of the sources 
and determine the corresponding admissible thermomechanical process." 

The input of a problem thus involves a set of source fields S that character- 
izes the specific environment under consideration. Substitution from the constitutive 




9. A Multifield Theory for Shape Memory Materials 



229 



assignments and gradient definitions (9) into the balance equations (49) then pro- 
vides the equivalent of 15 scalar PDE’s in the unknown fields X which, given suitable 
boundary conditions, describe the quasistatic behavior of shape memory materials. 
If one or more solutions of the system are found,^ then the previous theorem ensures 
that they induce an admissible process via back-substitution within the constitutive 
assignments. Since the energy balance has already been used to find restrictions (43), 
it does not contribute to the final system. If constraints are present, the system is 
enlarged to include (35) and the unknown fields are augmented by the associated 
reactions. 

The equations arising from macroscopic linear momentum and entropy balance 
are standard in structure whereas those arising from the balances of microstructural 
linear momenta are peculiar for the description of the shape memory materials be- 
havior as they govern the evolution of the microstructure. Specifically, the two scalar 
equations 

('’Si 

' , ' ' ' 

+ + Div psl'^) = , 

. - - . ^ ^ (53) 

+ (~pul^ + Divps^*') = pp>^ 

provide Martensite transformation balances that govern the conversion between 
Austenite and Martensite, whereas the tensorial equation 




+ (-pnS* + DivpsS*) = pb^ 



can be regarded sls a Martensite reorientation balance and governs the transformation 
among different types of Martensite. In the above balances four groups of terms are 
identified: one, depending on the free energy gives the internal driving force of 
the phase transformations; a second one, arising from the dissipation function Tpj 
via the various As, provides the internal resistance to the phase transformations; a 
third one associated with the reactions of the internal constraints; and, finally, a source 
term that describes the external actions directly influencing the phase transformations. 
Simplified versions of the Martensite transformation balances were derived within an 
internal variable framework in Bernardini and Pence (2002b). 



^We do not discuss existence and uniqueness issues. 
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The case of (2) permits elimination of the phase fraction reactions between the 
two phase transformation balances (53) when the inequality constraints are inactive. 
This gives 






(54) 



As described in the introduction, other models for shape memory materials are based 
on phase transformation kinetics that are either posed directly or obtained by an 
alternative development. In the context of the present development as embodied in 
(54), kinetic equations follow as time derivatives of the transformation balances and 
therefore, besides the fundamentally different derivation, exhibit a richer structure 
due to the presence of the divergence term and the source term. 

In the statement of specific problems the issue arises as to the explicit depen- 
dence of the constitutive functions and constraints upon F, M, so as to provide 
necessary and sufficient conditions for the observer invariance of the constraints and 
constitutive equations. With respect to (37) this issue is addressed by the following 
well-known result (see, e.g., Ogden, 1997) which follows from the polar decomposi- 
tion theorem of nonsingular linear transformations. 



Lemma 3. Let W be a vector space. If a function z : Lin Ve -> W satisfies z(A) = 
Z (QA) for all proper orthogonal Q and nonsingular A , then there exists a companion 
function zi such that 

z(A) = z 2 (A^A). (55) 

It follows from this lemma that the constraint functions present in (37) can thus 



be expressed in the form 


|^‘>(C'^) = 0, C;^‘>(C) = 0, (;,A:) = 1,.. 


• » (^eq> ^eq)) 


(56) 


< 0. C|"(C) < 0, U, k) = l,.. 


• » (Mn? ^in)* 


with 


C := F^F, := M^M. 




(57) 



Under (12), (13) with Qo = I the appropriate observer representation transformation 
rules are 



C+ = C, (C^)+ = c^, (58) 

whereupon it follows that no additional conditions are required on the functions 

ensure observer invariance. The reactive stress So given in (39) can 
thus be expressed in terms of derivatives of the functions with respect to 

C. Similarly, the reactive stress and reactive production 11^ given in (39)-(40) 
can be expressed in terms of derivatives of the functions g®*’, fy* with respect to C*^. 
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The functions Cf can always be taken to have symmetric dependence 

on their symmetric tensor argument of either C or C^, which in turn simplifies the 
resulting derivatives so as to give 

Peq 

5eq “ 



Sea=2^pf‘’F- '■ 
1=1 



ac 



N, 



eq 






N, 



eq 



n« = ™ 

;=i 












Analagous expressions hold for Sin, Il|^. 

Turning to the explicit dependence of the constitutive functions upon F, M, F^ 
the appropriate extension of (55) is given by the following. 



Lemma 4. Let W be a vector space. If a function z : Lin Ve x Lin Ve x Lin(VE, 

Lin(VE)) -> W satisfies z(A, B, G) = z(QA, QB, Q o G)for all proper orthogonal 
second-order Q, nonsingular second-order A, second-order B, and third-order G, 
then there exists a companion function zi such that 

z(A,B,G) = Z2(A^A,A'^B,A’*'iG). (59) 



Proof The definition 

Z1 (A, D, H) := Z(A, A"^D, A~^ i H) 
gives z(A, B, G) = zi(A, A^B, PJ o G) and 

Zi(QA, D, H) = z[QA, Q(A"^D), Q i (A“^ o H)] = zi(A, D, H) 

for all proper orthogonal second-order Q, whereupon the previous lemma gives the 
existence of zi such that z\(A, D, H) = zi(A^A, D, H), whence 

Z(A, B, G) = zi(A, A^B, A^ i G) = Z 2 (A^A, A^B, A^ i G). □ 

As a consequence of (59) it follows from (47) and (48) that there exist suitably 
smooth functions \f,K,rpT such that 

;^(e,m;X) = ?(e°^m°^x), 

K(e, g, m; X) = K(e°\ g, m'”’: X), 
fpT(e, m, m(j); X) = ^p^(e°^ m°^ X), 

where the modified set of arguments read as 
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with 

rM pT-M, h''’^ := o (60) 

Since (43) gives expressions involving derivatives of yj/ with respect to F, M, 
F^, the following connections to yj/ are noted: 




where, in (6 1 ), is required, without loss of generality, to have symmetric dependence 
on C. Similarly, from (34) it follows that 






= 2F 



9T pT 

ac 



+ M 



ar pT 

aHM 



T 



— \T 

ar PT \ 

anFM / 



(64) 



aM 



p^rpT- 

aflM’ 



afm 



= F( 



or PT 

aiP^’ 



(65) 

( 66 ) 



where the dependence of on C is also symmetric and the derivatives are un- 
derstood, if needed, in a generalized sense. The response function specification in 
(43) thus holds under the replacement [yj/, K} ^ {V^, K}, where (61)-(66) aid in the 
expression of S, El^, and S^. Under (12), (13) with Qo = I the appropriate observer 
invariance transformation rules for (60) are 

(hM)+^rM = 

whereupon it follows with the aid of (58) that no additional conditions are required on 
the functions ylr,r pT,Kto ensure observer invariance. This in turn confers observer 
invariance on the full set of response functions R in (28). 



9.10. An infinitesimal strain theory 

9.10.1. General development 

On the basis of the previous developments, an infinitesimal strain model can be 
constructed for the case where 

(F - I) . (F - I) « 1 , (M - I) . (M ~ I) « 1 

such that (F — I) • (F — I) and (M — I) • (M — I) are allowed be of similar magnitude. 
Internal constraints are assumed to be given by (2), (3) and (6), (7). To this end, it 
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is useful to introduce the displacement gradient 3^ := F — I and the corresponding 
quantity := M — I. Then the following linearizations are obtained: 



c = I + 5^ + = l + + 

= I + 3^ + (^F^T- ^ jjpM ^ pM 

Equations (67) thus motivate the definitions 



.M 



sM , /sM\T 



(67) 



1 



e := ^[5' + («Vl, a:=^[i^ + (S’^y], /t := ^[5"* + 

where e is the conventional infinitesimal strain tensor and a can be regarded as the 
corresponding Martensite transformation strain. Accordingly, in this infinitesimal the- 
ory, the fields {e, a, fi, F^} replace {C, C^, H^, H^^}. Direct connections between 
these fields are given by 



1 . 



M I /sM\Ti 



1 . 



sM I /sF\T 1 



e + oc = ii + ii‘, F^ + (F'^r = 2^. 

OA 

The constraint functions (6) and (7) expressed in terms of a give, to leading order, 

g^(a) = ce-l, gf(a) = 4a-cc-(g*f, 

where g* is necessarily small. Thus the reactive stresses and productions associated 
with the equality constraints are now 

+ [(I ® I) - (I ® Ifl o (pM i = I ® c^‘). 

The constitutive assignments in the infinitesimal treatment are inherited from the 
finite strain treatment simply by replacing {C, with {l-h2e,l^2fi, F^} in 

the response functions t/r, K after retaining only leading-order terms in {e, /t}. 
This entails the existence of suitably smooth functions K, Ppp such that 

t/r(X,0 = ^^(e”f,m‘"^X), 

K(X,^) = K(e‘"^g,m“;X), 

TprCX, 0 = ^pr(e‘"^ X), 

where the modified set of arguments read as 

:= [e, := {^A, Hm, fi, F«^ , F^«, F’^}. 

The application of (33) to Fpj then leads to functions A analogous with those already 
introduced in (34). The remaining reduced constitutive equation fields follow from 



doc 
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the previous development after substitution of the modified response functions 



s 




1 t^m 2 / 


9FmJ 






V 




q = 


-pKg, 








n?A 


dxfr 




gfx 


dx/f 




-ps^^ 




d^lr 

Hm 






dyfr 






„M 


1 a^ 

2 diL 


- + ng*. 


SM 


- ^gpM 


+ pi?A* - 





( 68 ) 



( 69 ) 



with 



82^ ’ 



~F afpr , 

A = h 


MJ/ 


'9rp7-\^ 




a« 


2 8/1 \ 


9FM j ’ 








A^ = 


1 ar pj 


9Ja 






2 dfi 


~F^A aPpr 


aPpr 




ar PT 


^ ~ ’ 


~ 8F?W ’ 


A = 


9FM ’ 


_ 9rp7- 









( 70 ) 



where the dependence of both and Vpt upon e is taken to be given by a symmetric 
representation and, if needed, the derivatives have to be understood in a generalized 
sense. 

A useful special case occurs if the dependence of ^ upon fi is mediated solely 
through sym[/i-]. Then, since sym[^] = i(a + e), it follows that 

^ (e‘"^ , ; X) = ^ ; X) 

for a suitably smooth function where 



Taking a symmetric representation for the dependence of ^ on e and a symmetric 
representation for the dependence of ^ on both e and a gives the connections 
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whereupon (68), (69) gives that S and 11^ take the form 






df 






da 

Similar conclusions apply to A^, in the event that the dependence of Vpr upon 
fi is solely through sym[/Lt]. 



9.10.2. Example 

A simple example is described in the following for the purpose of showing certain 
features regarding the structure of the resulting model. To this end, the following 
specialization of the basic response functions are considered: 

if = 

fpr = rp7-[|yl, Im. (t/l. (71) 

K = /cl, /c > 0, 

where, as prescribed by Theorem 1, ^M)i are the values attained by the phase 
fractions at the switching instants which allow for the modeling of internal pseudoe- 
lastic subloops. The above constitutive dependence for Fpr has been introduced for 
the sake of simplicity and it provides for hysteresis in the transformations between 
Austenite and Martensite but not in the Martensite reorientation. The latter hystere- 
sis is associated with Martensite reorientation that lags S and can be accommodated 
directly in this framework by considering V pj with dependence also on a. 

Taking (e, a, = (0, a, i?o, 1,0) for any a obeying g'"(a) < 0 as 

reference state for Austenite and («, a, |m) = (0, 0, 0, 1) as reference 

state for Martensite, the following expression is considered for the free energy, 

w 1 2 2 

V^(«, a, ^m) = -(« - oCo{e — 

r ^ 

-hch^ — 1^0 — — 

L ^^0 

+ ^m[-A^o(i> - ^o) + («)] + ?m), 

where 

■= := ia o B o a, (73) 

respectively, provide expressions for an Austenite-Martensite and a Martensite- 
Martensite interaction energy. The scalar Q and the fourth-order tensor B characterize 
the details of the underlying model for the phase arrangement and, for the sake of 
simplicity, are assumed to be constants. They can be either derived experimentally 
or computed by micromechanical analyses (Bernardini, 2001). Also, C is the elas- 
tic modulus fourth-order tensor, c the specific heat, rjo the entropy at an Austenite 



- - i^o) + lAo 



(72) 
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reference state, rjo Ar]o is the entropy at the Martensite reference state, and is 
the free energy at both an Austenite and the Martensite reference state. Again for the 
sake of simplicity the difference in the material parameters between Austenite and 
Martensite have been neglected but it can be taken into account by following, e.g., 
Bemardini (2001). Also, piezocaloric terms associated with, say, thermal expansion 
are here omitted. The tensors C and B are assumed to obey the symmetries 

c = B = (74) 



and are also taken to be invertible with respective inverse tensors S and A. By ob- 
serving that 

^(0, a, 1,0) - ^(0, 0, 0, 1) = Arjo(i^ - i^o), 

it follows if Ar/o ^ 0 that i^o is a unique reference temperature in the sense that 
^ ^ gives 

^(0, a, 1,0) ^^(0,0, 0, 1). 

As a consequence of the above assumptions, it now follows from (68), (69) that 
2 

S = pCo(e- ^A/a), t] = c\n — + rjo-^ 

I/O (75) 

s^^=— pci, = — pci, S^ = — pl(g)ci, 



and 



= -Q^a + A»jo(i7 - #o) + a o C o (e - 



+ ai — -aoBoa — 
2 



= ?m[C o (e - ^A/a) - B o a] + ail 

+ [I®I-(I(8)I)^'lS(F^oci)^, 



where the functions and A^*^ characterize the specific expression of F pt and 
can depend on the present value of the state as well as of the values attained by the 
phase fractions at the switching instants (Bemardini and Pence, 2002b). For constant 
p consider the case of vanishing source fields b, in conjunction with 

inactive inequality constraints. Then the governing equations reduce to the equality 
constraints 



= l» 

a 1 = 0, 



(76) 

(77) 



the stress equations of equilibrium 

Div[C o (e - Ia/o!)] = 0, 



(78) 
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the heat equation 

/C !? + r + - i^Ano^M = ch, (79) 

the reorientation balance 

- Div(I <gi Cl) + ^m[C o (e - ^Mo) -Boa] 

~ 2 TV/f 1 'T \o0} 

+ ail + [I (8) I - (I (g) I)'' "] o (F"^ o ci)^ = 0, 



and the phase transformation balance 

- Ha) + Ar;o(t^ - ?^o) 

2 2 1 2 2 ( 81 ) 
+ a o C o (e — Hm^) — -oc o B o a = ~ A^^). 

If the fields (e, t>) are now taken as spatially homogeneous (independent of X) then 
these equations support a spatially homogenous solution for with 

= 0, and arbitrary spatially homogeneous ci. Specifically, the stress equations of 
equilibrium (78) are satisfied identically and the heat equation (79) gives the requisite 
process heat supply r. The time evolution of the microstructural variables a, 
and the remaining constraint variable a\ are then determined by the constraints (76), 
(77) in conjunction with (80), (81). If, in this context, stress and temperature are re- 
garded as prescribed, then (S(0, ^{t)) are both intensive in the usual thermodynamic 
sense. This provides a standard Gibbs framework for considering the response of the 
remaining variables a, Substituting from (75) into (80), (81) gives 

?m(S - /oB o a) + pa\l = 0, (82) 

P^i^M - ^a) + pAr]o(d^ - t?o) 

+ a o S — ^a o B o a = — A^^). (83) 

Equations (82) and (77) can be solved for a and a\ giving 

a = iA(S), (84) 

P 

with 



A(S) := A o S - 



loAoS 

iSaSi 



All, 



(85) 



where A, as the inverse to B, inherits symmetries from (74). Note that if B is isotropic, 
then (84) renders a as a scalar multiple of the stress deviator. 

With a thus determined from (84), the remaining equations (76) and (83) de- 
termine the response of Hm as for example discussed in Bemardini and Pence 
(2002b). This response is generally hysteretic due to the effect of the right side of 
(83). If, however, the right side of (83) vanishes, then Hm can be given explicitly 
in terms of the current values of S and as 



Ha = ?a(S, 1^), Hm = Hm(S, #), 




238 



Davide Bemardini and Thomas J. Pence 



with 



?a(S, 2 ~ AQp^ ^ ° ^(^)» 

?m(S, - &o) - o A(S). 

More generally, the inequality constraints associated with (3) and (7) give 



a = 



?A = 



= 



iA(S) ifA(S)-A(S)<i(pg*)2, 

^-^A(S) ifte-A(S)>l(p,*)^ 

0 if?4(S,i?)<0, 

?a(S,i?) ifO<?A(S,i?)<l, 

,1 if?A(S, I?) > 1, 

0 if?M(S, &) < 0, 

?m(S, 2?) ifO<?M(S,i?)< 1, 

1 if|A/(S, t?) > 1, 



( 86 ) 



thus providing stops to the phase fraction evolution and a limitation on the magnitude 
(but not the direction) of a as otherwise given by (84), (85). The explicit determination 
of a, and in terms of the current S and given by (86) is consistent with 
standard Austenite-Martensite equilibrium phase diagrams for hysteresis-free phase 
mixing. Consideration of a nonzero right side to (83) then describes hysteretic phase 
fraction evolution about this Gibbs equilibrium. 



9.11. Conclusions 

The framework provided by multifield theory combined with a suitable constitutive 
prescription for the entropy production is well suited for modeling the macroscopic 
themomechanical behavior of shape memory materials. An appropriate multifield 
framework involves two scalar microstructural fields describing the phase partitioning 
and a second-order tensor microstructural field describing Martensite orientation. 
Together this generates a description for pseudoelasticity, shape memory effect, and 
reorientation of Martensitic variants at both low and high temperature. The present 
work has focused mainly on the discussion of general aspects of the theory paying 
special attention to the treatment of balance equations and to the exploitation of the 
constitutive structure. A first specific example involving an explicit prescription for 
the constitutive functions has been presented with reference to linearized kinematics. 
Further models obtained from richer expressions of the constitutive functions, as well 
as numerical simulations aimed so as to reproduce the experimentally observed spatial 
localization of the phase transformations, will be presented elsewhere. 
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10. Balance at a Junction among Coherent 
Interfaces in Materials with Substructure 

Gianfranco Capriz* and Paolo Maria Mariano^ 



Abstract. The junction among coherent interfaces separating phases in continua 
exhibiting material substructure is modeled as a virtual bar endowed by its own 
measures of interaction (macro- and microstresses and self-forces); balance equa- 
tions at the junction are derived within a multifield theory. To involve bodies with 
substructure is not an idle formal exercise; strong anchoring conditions at the 
boundary rarely allow the existence of smooth solutions. Thus interfaces are more 
frequent and junctions inevitable if overall substructural order need be preserved. 
Besides, our development frames together old remarks (e.g., a remark by Gibbs) 
and new suggestions. Also we have the chance, here, to return to some nontrivial 
open issues of the general theory. 



10.1. Introduction 

We derive here balance equations at a junction among coherent interfaces in bodies 
exhibiting material substructure. The junction is a line in three dimensions along 
which different phases meet; it is endowed by an excess energy (line free energy) 
which may be evaluated by experiments (see Gibbs, 1948). 

The substructure within the material is treated here as in (Capriz, 1985, 1989): 
We associate with each material patch P the position x of its center of mass and 
some information about its texture, the latter through an order parameter chosen as 
an element v of an appropriate finite-dimensional differentiable manifold A4. The 
geometric properties of M determine the nature of each specific substructure. Sub- 
structural interactions provide extra power, are balanced, and are typically associated 
with V and, perhaps, its co variant gradient; their rendering depends on the geometrical 
properties of M: If M admits a physically significant connection, then it is possi- 
ble to resort to appropriate distinct quantities called conventionally microstress and 
self-force to represent substructural interactions. 

Where discontinuity surfaces are present, even across one another at a junction, 
the picture becomes more complex because such surfaces and lines may possess their 
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own energy. Thus a deep ambiguity between form and substance arises. On the one 
hand, the surfaces model thin transition layers involving a crowd of atoms; the layers 
are not “intrinsic” in that they can “reform” so as to adjust to given side conditions, 
the layers comprising other atoms. In mathematical terms the discontinuity surfaces 
must be interpreted as “free internal boundaries”; in particular, balance laws must be 
of the local type and pointwise balances of interactions at the interfaces and at the 
possible junction must be considered (or, better, deduced). 

It is for us to conceive a consistent and adequate model of the physical circum- 
stances, hopefully summoning only easily available tools. Experimental evidence 
seems to show that the membrane model is satisfactory for the interfaces; would then 
the classical theory of strings offer adequate hints to describe the behavior of the 
junction? In the standard case we need a rod to hold firm the margin of a real mem- 
brane, but here, of course, the junction is the margin of a number of “membranes” 
and it could move and adjust its shape to respond to particular circumstances. Thus 
to answer the question one need refer back to experimental evidence. Besides, the 
involvement of substructures begs the question of what is then a “membrane,” a 
“string,” or a “rod”; thus we must provide first an answer still to another question. 



10.2. The standard instance 

10.2.1. Notation 

To focus on some ideas essential in the study of a junction, let us start by considering 
first simple materials', then the body B is identified with its reference placement B 
which may be assumed to be a fit region. 

Given a frame of reference {X} for B, a sufficiently smooth bijection x(X) 
shows the present place of P at X in S; F denotes, as usual, the placement gradient 
Vx (assumed to be such that detF > 0). 

Although we always have in mind the present setting of B and seek balances 
there, we write equations in terms of X as it belongs to B and consider images of 
surfaces presently occurring, as they are bijected within B. We use standard notation, 
briefly recalled below for convenience. 

Within B let N oriented intersecting surfaces Tik (k = 1, . . . , A) be selected 
by the assignment of N smooth functions fk(X) so that 

= {X e clB, fk(X) = 0}, k = h...,N; (1) 

the orientation of each Ejt is given through the normal vector field = V/^/| V/j^|. 
Notice that, later, we do not exclude the influence of time t, but for simplicity, we do 
not mark that dependence here. 

Given any smooth vector field a on T>k, we denote its surface gradient by 
The curvature tensor L is the opposite of the surface gradient of the normal vector 
field L = — Vsn, while the overall curvature K is the opposite of the trace of L: 
JC = tr(Vsn) = Divs n. 
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Given a field e that is continuous on and admits the limits (s > 0) 

— lim e(X ± enk), X e (2) 

£— >0 

we use the notation [e]fc for the jump e'^ — of e at Ejt and (e)k for the average 
^(ek + ek), with e taking values in a linear space. Of course, given two fields, say, e\ 
and 62 , then, with some significance assigned to the product, 

[^\^ 2 ]k = [e\]k{e 2 )k + {ei)k[e 2 \k- (3) 

If the field F has the properties just mentioned for e, then the surface Ejt is 
coherent if 



[F]jt(I - njt (8) = 0, X € Ejt, (4) 

where I is the unit tensor while (I — (g) njt) is the projector on Ejt. 

The condition of coherence is presumed satisfied from now on, and we find it 
convenient to have a notation for the surface gradient 

Fit = Vex = (F)it(I-njt 0njt), X € (5) 

Our interest here centers on the case when the surfaces Ejt all “fan out” of a 
single “junction” 3. We assume that d be a simple connected curve and that, during 
the deformative process, it remains a simple curve; i.e., we impose the requirement of 
stability of the junction. We assume also that 3 be smooth so that it can be represented 
by a smooth function r(S) such that |r,s| = 1; S € [0, /]: / the length of 3. If r,s and 
rss are linearly independent, then in each point of d a unique triple is defined: the 
tangent vector t = r,s, the principal normal n, and the binormal. The curvature vector 
f) of d and the scalar curvature R can be obtained as 1) = t,s = r,ss, ^ = 1) • n. In any 
case, the plane normal to t is clearly available. 

We render explicit the suggestion that the surfaces Ejt “fan out” of 3 by requiring 
that (i) 3 is the margin of all Ejt and (ii) the k limits limx^r(s) njt(X) exist at all r(s) 
and their values, say, njfc(r(s)) = nk(s), at each r(s) are all different. 

We also require the existence for all fields e of interest of limx->;j e^(X,t), 
X G Ejt. In particular, if Fk^k-hi is the gradient of deformation in the region of 

B between the surfaces Ejt and E^+i , we presume the existence of the limit F^ = 

limx->aFit,^+i(X),X G Bk^k-hi- 

The coherence condition (4) implies that - F^^^ ^ f^^ 

/ = 2, . . . , — 2 and k 1 < N — 1. 

10.2.2. Balances at the junction 

In the present subsection the following measures of interaction are involved: the 
Piola-Kirchhoff stress tensor T in the bulk, the surface stress T on the discontinuity 
surfaces, the line force h, and the line moment m along the junction. 

The line force h is a vector at each point of 3 and can be decomposed into the 
sum of the line tension at, with a = h • t, and the line shear g = (I — 1 0 t)h. 
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T is assumed to be smooth outside the surfaces and to suffer jumps along the 
surfaces For each Tjt is such that TkUk = 0. On each Ejt the corresponding 
Tjt is supposed to be continuous up to the junction. 

Cauchy’s equations of balance apply in B \ (Ua:=i 

pf + DivT = 0, (6) 

where, as in standard notation, p is the density of mass and f represents body force per 
unit mass in the reference placement; (7) is the symmetry requirement of the Cauchy 
stress (detF)“^TF^. A standard “pillbox” reasoning leads to pointwise balances on 
each Ejt, 



[T]knk + Divs Tjt = 0, (8) 

T,F[ = F,T[ (9) 

(see Angenent and Gurtin, 1989), where (9) is again a symmetry requirement but for 
the current surface stress 

To obtain the balance conditions along 3 consider the tube b/?\br wrapped 
around 3. The tube is the difference between two coaxial “curved” cylinders b/? and 
br scanned by discs Dr and Dr of radii R and r, respectively, (r < R) everywhere 
normal to 3 and with center on that piece of 3, where S 6 [Si, S 2 ]. R is supposed to 
be small enough so that the disc Dr for any chosen value S of S does not intersect 
any one of the discs corresponding to other values of S different from S. 

For the balance of forces on b/? it seems reasonable to require that 

f pf + /* + ^ f Tjtiiit + h|r(s2) “ h|r(si) = 0 (10) 

JbR J* j^„l 

if we interpret h as the traction on a cross-section of the junction and as the limit 

for r 0 of the surface integral over 9(b/?/br) l^ss the two circles at S = Si and 
S = S 2 , where njt is the normal to the boundary of 9 b/? fl Ej^ within the tangent plane 
to Ejt. Applying Gauss’s theorem to the surface integral, we obtain 



/* Tm = f DiwT + Yf [T]jtn^ + lim f Tm. (11) 

J* JbR ^iJbRni:k r^^Jdbr 

We assume that 

f |Tm| (12) 

Jdbr 

is hounded as r 0 so that we have limr-»o Tm = lim^-^o fsPr 
Taking into account also the assumption of continuity of Tjt up to the junction. Gauss 
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theorem allows us to change (10) into 




+ DivT) + 



^ f 
k=\ 



([T]jtn)t 4-DivE 




N N 



k=\ 



= 0 . 

(13) 



Taking into account (6) and (8) and exploiting the arbitrariness of the interval [Si , S2], 
we get the pointwise balance 



N 

hs + ^ = 0 along 3. (14) 

k=\ 

Circumstantial evidence similar to that subsumed in stating the balance equation 
(10) suggests the following condition of balance of moments: 

I (x - xo) X pf + MX - Xo) X Tm + 

t/b/? J* 

X Tkhk + (x(r(S2)) - Xo) X h|r(s2) - (x(r(Si)) - xo) 
xh|r(si) + m(S2)-m(Si) = 0 . 

By applying the Gauss theorem and by taking into account (6)-(9) and (13), we obtain 

txh + ms = 0 along 3. (16) 

again thanks to the arbitrariness of the interval [Si, S2]. 

Balance equations at junctions in three-dimensional Cauchy continuahave been 
derived in (Simha and Batthacharya, 2000). The differences between their final results 
and ours must be sorted out by experiments. They do not exclude a singular behavior 
of T at the junction leading to a nonnull limit for the integral Tm. Their omission 

of line moments along the junction may be justified by the type of dependence of the 
energy on the shape of the junction. Contrarily and naively one may follow this line 
of thought: When one endows the interfaces with surface stress, one imagines them 
to behave as membranes; when one stretches a membrane by fixing it at a prescribed 
boundary (as for some strange drum) one needs the boundary to behave like a beam, 
which may be the case of junctions that are margins of interfaces. Leaving the matter 
open, we adopted a prudent stance and admitted the possibility that the junction have 
the capacity of a rod. 



^ r 

V / (X - Xo) 

*/9b^nEjt 



(15) 



10.3. Considering substructures 

When the material patches exhibit a substructure, its descriptors need be introduced 
and relevant interactions balanced; new relations at the junction are then obtained. 
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10.3.1. Placements 

Here we model the substructure through an order parameter v chosen as element of 
an appropriate finite-dimensional differentiable manifold M whose tangent space at 
V is indicated with T^M. Two fields are defined on B, namely, the smooth bijection 

(17) 

representing the classical placement field mentioned in the previous section and the 
order parameter field 



v:B->M. (18) 

Examples can be quoted that are apparently very different from one another 
but in fact share the basic framework. Nematic liquid crystals are characterized by 
directions and a realization of M for them could be the projective plane or the unit 
sphere in with identification of antipodes. A single parameter v taking values in 
[0, 1] may describe porous solids or two-phase materials. Elements of R^ are chosen 
to declare the kinematic perturbation induced on the displacement field by diffused 
microcracks. Symmetric second-order tensors may figure out the macroscopic ef- 
fects of interacting macromolecules or partially ordered nematics. Skew-symmetric 
second-order tensors occur in theories of composites with diffused stiff fibers. Dif- 
ferent phases of a body could be characterized by values of v belonging to specific 
disjoint subsets of M. 

The geometrical properties of M determine the characteristic features of each 
special model. The substructural kinetic energy (if any) induces a metric on M; 
though the metric may be used to induce a connection, such a connection may be 
physically significant or not. Only the existence on A4 of a connection with physical 
meaning allows one to represent substructural interactions through appropriate dis- 
tinct quantities termed conventionally microstress and self-force. The alternative must 
be spelled out for the correct representation of the interactions. (See Capriz, 1985, 
1989, 2001 for a general framework for multifield theories; see also Mariano, 2001 
for additional results and a detailed list of references.) Here we restrict developments 
to cases where a physically significant connection on M exists and Vv has covariant 
meaning. In what follows v is assumed to be smooth on S/ U 

Motions are represented through time-parametrized families of fields and Vt 
such that the position of the point X e S at the instant t e [0,d] cR and the relevant 
value of the order parameter are given by 

x,(X)=x(X,0, Vr(X) = v(X,0, (19) 

where x^ and V/ are considered to be at least twice differentiable with respect to time. 
Thus velocities at X and t are given by 

x(X,0, v(X,0, (20) 

and for convenience we will indicate in the following with Vel the set of all possible 
pairs of fields (x, v) at (x, v). 
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Let us consider a time-parametrized family of proper orthogonal tensors Q(0» 
with Q e SO(3) at each t, and the associated family of corresponding vectors q(t) 
such that Q(0 = exp(eq(0), © being Ricci’s alternator. 

If we take Q(t) as the descriptor of a rigid body motion, the rigid velocity field 
xr associated with the placement field is given by 

X/? =c(0 + q X (x-xo), (21) 

where c(t) is the translation velocity, xq is some fixed point, and q(f) is the rotational 
velocity. 

V may sense a change of observer. After a rigid change of its frame described 
by Q(0 an external observer measures a new value Vq of the order parameter related 
to the value v evaluated before the change of frame by 

Vq = v + ^q + o(|q|), ^=-^|q=0, (22) 

where ^(v), an operator mapping vectors of into elements of the tangent space 
of is the infinitesimal generator of the action of SO (3) on M. generated by the 
element q x of the Lie algebra so (3): 

Vq = ^q. (23) 

Two different observers moving independently read two different pairs of ve- 
locity fields, (x, v) and (x*, v*), respectively, 

X* = X c(t) -I- q X (x - Xo), (24) 

V* = V + >fq (25) 

if c and q characterize the relative motion. 

10.3.2. Power and balance of standard and substructural interactions in the 
bulk 

Balance equations for substructural interactions are proposed and justified variously. 
Whereas in standard continua we know how the requirements of balance of forces and 
moments may be written directly and our imagination is sufficiently far-reaching to 
cover, without qualms, even rather unusual situations, when nonstandard interactions 
intervene, as in the presence of substructures, the path to the goal is much stonier. Then 
a smoother bypass begins by associating a measure of interaction to the time rate of 
each descriptor of the placement of the body and thus by coming to an educated guess 
of the virtual power of all external actions on any subbody and then proceeds by 
requiring its invariance under changes of observer (not only those due to the changes 
of frame in Euclidean space but also those, physically permissible, due to “change of 
frame” on M). 

Stretching analogies (fully conscious of limits of validity of our proposal and 
of its possible falsifiability in border cases), we take (omitting from now on the 




250 



Gianfranco Capriz and Paolo Maria Mariano 



dependence on X) 

= / (pf • X + • v) + f (Tm • X + 5m • v), (26) 

Jb Jdb 

where represents external bulk interactions acting on the substructure per unit 
mass and 5 is the microstress, and we set down the following fundamental axiom of 
invariance of power. 

Axiom. is invariant under any change of observer, in primis under the changes 

ruled by (24) and (25). 

To satisfy this foremost condition it is necessary and sufficient that, for all c and 
q and for all choices of the subbody b, 



"(//'■'IS 

+ ^ • {^j ((x - xo) X pf + A^pP) + j ((x - xo) X Tm + ^^5m)^ 



The arbitrariness of c and q implies the integral balances of forces and moments 

/ pf + / Tm = 0, (28) 

Jb Jdb 

I ((x - Xo) X pf + A^pP) + / ((x - Xo) X Tm + >t^5m) = 0. (29) 

Jb Jdb 

From (28), since b is arbitrary, the standard Cauchy equation (6) follows, whereas (7) 
is substituted by the specification of a contribution of the substructural actions to the 
skew part of Cauchy’s stress 

A^pP = eTF^ - DiviA'^S), (30) 

where e is Ricci’s permutation tensor. 

A deeper interpretation of (30) leads to the introduction of the self-force. By 
developing the divergence of A^S, we see that (30) requires (V>1^)5 - eTF^ to 
belong to the range of A^: There exists an element f of the cotangent space TfM to 
A4 at V such that 

A^^ = eTF^ - (VA^)S (31) 

and the following differential inclusion applies: 

pP — ^ + Div 5 € null space of A^ . (32) 

Thus f is determined apart from a component along the null space of A^ . Actually, 
there is a class of bodies for which the indetermination is obviously avoided. In fact. 



(range of A)'^ = null space of A^ 



(33) 
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(when a notion of orthogonality is available and (range of A)-^ is the set of “vectors” 
of T*M. orthogonal to all vectors in (range of A)), then whenever the range of A 
coincides with the whole tangent space of the null space of A^ is reduced to the 
singleton {0} and thus the inclusion (32) implies the equation 

PP — ^ Div 5 = 0. (34) 

That conditions are not so trivially simple, in general, is shown by the following 
example. Take as order parameter a stretchable vector d so that A is the second-order 
tensor ed; in such a case the tangent space comprises also vectors parallel to d (in 
pure stretching), whereas the range of A comprises only vectors normal to d. 

A way out of the impasse is suggested by the following remarks. Given coordi- 
nate systems in and on A4, ^ is expressed by a (3 x dim Af )-matrix of characteristic 

k, say (k < 3, k < dim A4) and the null space of A^ is a subspace of T*M with 
dimension dim M — k. 

Actually one can choose the coordinates on M so that any vector of T*M with 
the last dim M—k components null belongs to the null space of A^ . Hence a vector 
of TyM with the last dim A4 -- k components null gives rise to a virtual power 
that is observer independent, whichever choice is made for its first k components. 
Now, requiring that the virtual power (26) be invariant under changes of v allowed by 
that arbitrariness, stiiriarge when A: > 0, one eliminates the residual indetermination 
inherent in (32) and that inclusion collapses into the balance equations (34). 

An illuminating example is provided by stretchable vectors mentioned above: 
Instead of the components of the vector d, we suggest taking as coordinates two 
independent quantities: the unit vector d|d|~^ and ld|. For d|d|~^ one follows de- 
velopments as for nematic liquid crystals, where f is known only up to a vector 
parallel to d. For the length |d| (or any strictly increasing function of that length, 
which is not affected by rotations), the dual variable is the stretching action otherwise 
undetermined. 



10.3.3. Rods with substructure 

As a preliminary, we find it expedient to analyze the equilibrium of rods with sub- 
structure because this seems to be an adequate model for our picture of the junction. 
To this end, consider a rod as the region scanned by translating a regular plane region 
A along a regular simple curve r(£), with I the arc-length parametrization I e [0, £], 
while maintaining the center of A on the curve and A always orthogonal to the tangent 
t. We indicate with the normal at the bases A(0) and A(i) and with the normal 
along the lateral boundary dA x [0,1]. 

The theory of rods, in which a peculiarly thin three-dimensional body is the 
object, necessarily involves totals of actions over the body and on separate portions 
of the boundary; in particular, totals over cross-sections and first moments over those 
sections, too. Thus in the classical case, one is led to tractions h and bending and 
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twisting moments (adding up to a vector m) and to balance equations 

h£ + b = 0, (35) 

m£+txh + /x = 0, (36) 

where b is the external force per unit length (bulk action plus traction on the side 
boundary) and fji is the external moment per unit length. 

When rods with substructure are object of study, the peculiarity of the represen- 
tation of microactions through covariant vectors in the cotangent spaces is an obstacle 
to the evaluation of totals. One must face, first of all, the need of a deeper look at 
the properties of the manifold M embodying all values of the substructure. We are 
mainly concerned here with problems in statics; however, to justify the assumption 
that M be Riemannian, it is expedient to take heed of the kinetic energy associated 
with rates of change of the substructure, a quadratic form with coefficients that pro- 
vide a metric for M. Then a celebrated theorem of Nash (1954) assures us that a 
-isometric embedding of the Riemannian manifold M into an Euclidean space £ 
of dimension 2 dim A4 + 1 exists; some delicate points arise in connection with that 
embedding, in particular, the question of its possible insufficient regularity for all 
developments that follow below. Using Euclidean spaces of higher dimension, Nash 
has proved that C^-smoothness (k > 1) can be achieved (Nash, 1956). However, 
optimistically we brush aside these preoccupations and proceed as though no further 
obstacles intervene. The widening of the dimension, the nonuniqueness of the em- 
bedding, the physical meaning of the possible lack of smoothness, all must be placed 
against experimental evidence case by case, and for each, hopefully, a justification 
must be found. Convenience and simplicity may push one in accepting less regular 
embeddings in lower-dimensional Euclidean space; one must be aware, however, that 
one may thus forecast transitions less smooth than occur in reality. 

We will work below using the same notation for variables and fields as already 
introduced, as though we were still on M (now a hypersurface in a convenient £) 
rather than in £, though, as a modest reminder, we use bold letters to denote the 
extended versions of the same quantities (exception is made for <S with a slight abuse 
of notation). Perhaps within the domain of each separate phase the values fall only 
on M, but to discuss transitions and give values to phase differences, we must allow 
ourselves to exit from M and move freely in £. Such is the case, for instance, of 
the so-called chevron transition in nematic liquid crystals: Even if we consider such 
transitions sudden (i.e., first order), the jump is represented by an isotropic state (not 
on M, now the manifold of directions). If we take the transition to be through a 
thin layer (and smooth within it, i.e., second order), it can be described only in the 
embedding space (of symmetric, traceless tensors). 

Another totally different reason for widening the dimension of the spaces where 
states and interactions are placed is the possible wrinkling of the interface. Then, a 
convenient mathematical model may consist in the smooth surface obtained through 
an “ironing out” the wrinkles, on which added variables (e.g., a roughness tensor) 
account for the wrinkles; the latter, even in a classical elastic solid for instance, allow 
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the intrusion of distributed torques otherwise obviously absent. However, a general 
study of such a class of interfaces seems to be missing as yet. 

We denote by p the interactions per unit length on the substructure which are 
the sum of the volume actions due to pP per unit volume and the surface actions per 
unit area on the lateral skin measured by Sni and by p the internal force per unit 
length. We also set 






= f A^Sub. 

Ja 



The second-order tensor A^S applied to gives rise to a standard couple. Using the 
same notation as before, we return to the integral balances (28) and (29) applied to a 
generic cutting of the rod (£ € [€i, € 2 ] £ [0, 1]) and accordingly write the integral 
balances of forces and couples, respectively, as 



h2 — hi + 



ri2 _ 

/ b = 0, 

Jii 



(x(^2) - Xo) X h 2 - (x(£i) - Xo) X hi + m 2 - mi 



rh _ _ rti 

I {(x - Xo) X h + ix) + A^si - A^s\ + rfi 2 - tfJi + / A^ p = 0. 

JU Jti 



The arbitrariness of [£i , £ 2 ] implies both (35) and 

t X h + /X + (m + + A'^^s + A'''(s,t + /3) = 0. 

We then see the need of the existence of a self-force p' such that 

A^p' = -t X I - /r - A^^s - (m + 

This allows us to write (40) as 



5 £ -|- pp — p = 0 



under the condition that 



p = p' + p" and A^p" = 0. 



The indetermination in (43) may be eliminated by reasoning akin to that sug- 
gested at the end of Section 10.3.2. However, if the matter had been already investi- 
gated in three dimensions, then the integration over the cylindrical body of (34) would 
settle the question. Actually, complications may arise if wrinkling were present, as 
mentioned above; then p may not be simply linked to f . In conclusion, on a rod, the 
material substructure generates basically an increment rfl of the standard torque. The 
standard equation (36) is evidently a special case that one reaches in the absence of 
substructure. 

Analogies and differences with the theory of filiform bodies with substructure 
(Capriz and Cohen, 1988) are evident: The basic difference is the presence here of 
the term (m + m),£ collecting the standard moment m and the extra standard moment 
m generated by the microstress (together with the applied couples pi). 
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10.3.4. Membranes with substructure 

Let E be a compact regular surface embedded in the three-dimensional Euclidean 
space given by 

i: = {X 6 s.t. /(X) = 0} (44) 

with /(•) a smooth function 'm £^. S is oriented by the normal vector field n = 
^nd has a regular boundary 9E endowed with normal n belonging to the 
tangent plane of E at X. 

The surface E is a natural geometric model of a material membrane, one pre- 
sumes the existence of a surface stress T (say, a Piola-Kirchhoff surface stress) such 
that Tn = 0 and thus associates tractions tangential to E with tangent vectors on E. 

If b is the body forces per unit area and p the density of mass per unit area, the 
balance of standard forces becomes 



Divj; T + pb = 0 on E, (45) 

while the balance of torques implies 

TF^ = FT^, (46) 

where F(X) is the surface deformation gradient Vj;X(X) as in Section 10 . 2 . 

To consider membranes with substructure, we first associate an element of M 
with each point of E, then consider in addition to T a surface microstress S such 
that Sn = 0. S associates elements of the cotangent space of M with tangent vectors 
on E. 

We indicate with ^ the body interactions per unit area on the substructure. We 
consider also an arbitrary subsurface E* of E and write the balances of forces and 
torques on E* as follows: 



I ((x - xo) X pb + pA^^) + 

Jt* 



[_ ph+ [ Tii = 0 , (47) 

Jt* Jat* 

f ((x - Xo) X Tn + >l^Sn) = 0 . (48) 

Jdt* 



By (47), the arbitrariness of E* again implies the pointwise balance (45), while by 
(47) we obtain 



eTF^ - (Vj,^^)S - A^iDivf. S + )3) = 0, 

which suggests the existence of a self-force 3 ' such that 



= eTF^ - (Vj^^)S. 



Consequently, (49) becomes 



Div^ S + p)S - 3 = 0 



(49) 

(50) 

(51) 



with 



i = }' + f and A^i" = 0 . 



(52) 
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Remarks at the end of Section 10.3.2 apply and allow us to recognize the special 
nature of 3 " in each relevant case. Of course, had we already decided how to avoid 
the indetermination in three dimensional bodies, we would only need to obtain again 
the final balance equation by integration of (34). 



10.3.5. Balance of standard and substructural interactions at a discontinuity 
surface 



We turn now to the topics discussed in Section 10.3.2 and deduce the balance equations 
at each discontinuity surface taking into account the results in Section 10.3.4. We 
do not consider also surface body interactions like b and ^ 0 , but assume the order 
parameter field continuous across the interface. 

Let bs be an arbitrary subbody crossing one interface E and far from the others. 
The subbody bs is considered as a “pillbox” around E: It is such that the intersection 
between the boundary Obs of bs and E is a curve 8 bs H E. 

Taking into account the developments in Section 10.3.4, the integral balances 
of forces (28) and moments (29) become on bs, respectively. 



f pf+ / Tm+ / Tn+ / b = 0, (53) 

t/bs Jdbj: Jdb-zHYi t/bsOE 



I ((x - xo) X pf + ^ pP) + I ((x - xo) X Tm + A^Sm) 
Jby: Jdb'£, 



'bj: 

+ 



I ((x - Xo) X Tn + ^^Sn) + f ((x - xo) x b + A^ p) = 0. 
Jdb'zD'Z t/bsflE 



By shrinking bi; at the interface, we obtain 

f /Of 0, f ((x - Xo) X (Of + A^pP) 0. 

Jbj: Jbj: 

asbi;->bi:nE, because the integrands are continuous on 6, and 



f I [T]n, 

JdbY, t/bsHE 



(54) 



(55) 



(56) 



/ ((x — Xo) X Tm + ^^5m) 

JdbT, 



f (X - xo) X [T]n + A^ [5]n (57) 

•/ b sHE 



as bi: -> bi: n E. Thanks to the arbitrariness of bs and the use of the Gauss theorem, 
we obtain 



b -f [T]n + Divs T = 0 on E 



(58) 



from (53) and 

A^p + A^[S]n = eTF'^-DivY.(A^S) on E (59) 

from (54). By developing the divergence of we note that (58) requires the term 
(Vs, 4 ^)S — eTF^ to belong to the null space of A^; i.e., there exists an element 3 
of T*M at each v such that 

= eTF^ - (Ve^^)S onE. 



(60) 
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Note that equation (60) indicates that the presence of material substructure renders 
nonsymmetric the surface Cauchy stress tensor as well as it renders nonsymmetric 
the standard Cauchy stress TF^, as stated by equation (31). 

Previous conditions imply the following differential inclusion 

+ [<S]n — 3 + Divs S € null space of on E, (61) 

which reduces to 

P + [S]n - 3 ' + DivE S = - 3 " on E , (62) 

with 

3 = 3 ' + 3 " and ^V = 0, (63) 

an indetermination that can be avoided by a pillbox argument based directly on (33), 
with the usual proviso of the absence of wrinkling. 

Even in this case, remarks of Section 10.3.2 apply and allow us to recognize in 
each special case the nature of 3 ". (The measures of surface substructural interactions, 
i.e., surface microstresses and surface self-forces, have been introduced in Mariano 
( 2000 , 2001 ), and the related balances have been deduced there.) 



10.3.6. Balance of standard and substructural interactions at the junction 

To obtain the balances of standard and substructural interactions at the junction, we 
consider the subbody b/? described in Section 10.2.2, coinciding with a cylinder 
wrapped around the junction. As in Section 10.2.2, if we take an arbitrary point of the 
piece of the junction in b/? and consider the section of b/?, obtained cutting b/? with a 
plane orthogonal to the tangent at the junction at the same point, we find a disc Dr of 
diameter R. (For geometrical details, see Section 10.2.2.) The piece of 3 included in 
b/? is a curve with arc-length parametrization S € [Si , S 2 ]: the points r(Si) and r(S 2 ) 
are the intersections of 3 with the boundary 8 b/? of b/? . 

We do not consider here line body interactions p on the substructure at the 
junction and write for b/? the integral balances of forces and torques as 

f pt+ f Tm + V f + f hs+ f b = 0, (64) 

Jbp JdbR /. 1 t/9bffnEt Jsi Jbi 



I ((x - xo) X pf + A^pP) + f ((x - xo) X Tm + A^Sm) 

JbR Jdbp 



b^ns 



N 



+ E / “ *o) X TkAk + A^Skhk) 

r~, Jab 



•'9b/?nz:jt 



(65) 



N 



+ (m + tfi)(S2) - (m + m)(Si) + ^ f ((x - xo) x ii + A^^) 

+ (x - xo) X h(S2) - (X - xo) X h(Si) + «4^Slr(S2) - >l^S|r(Si) = 0, 



which are the counterparts of (28) and (29) and account for the results in Sec- 
tions 10.3.3 and 10.3.4. 
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To obtain the integral balances of standard and substructural interactions at 
the junction, we take the limit /? -> 0 in (64) and (65) by assuming not only the 
boundedness of (12) but also that 



f l<5m| 

JdbR 



( 66 ) 



is bounded as /? -> 0. This implies that 



lim f <Sm = f lim f Sm = 0. 
^-^^JdbR Jsi 



(67) 



As a consequence of (12), we find (as in Section 10.2) that (64) reduces to 






( 68 ) 



as /? -> 0, which is the integral balance of forces at the junction. The arbitrariness 
of the interval [S\ , S 2 ] leads to the pointwise balance (14), which we rewrite here as 



N 



h s + ^ Tjtn^t = 0 along 3. 



(69) 



k=\ 



We apply the same procedure to (65) and, thanks to (68), obtain the integral 
balance 



X h + (m + m),s + (^^s),s + ^ 

The arbitrariness of the interval [Si , S 2 ] implies 

N 

t X h + (m + fTi),s H" “I" “f" ^ 



k=i 



which suggests the existence of a self-force p such that 

-A^p = t X h + (ITH- m),s + .4 gS. 
This allows one to write (72) as 



•4^ ^s,s — p + ^ = 0, 



(70) 



(71) 



(72) 



(73) 



which implies the differential inclusion 

N 

s,s — P + ^ Stiii e null space of 



k=l 



(74) 
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With the help of the developments in Sections 10.3.2 and 10.3.5, we see that in the 
case in which A covers at each v the whole tangent space of Af , the differential 
inclusion (74) reduces to 



N 

5,s - p + ^ = 0 along 3. (75) 

k=i 



When A(v) does not cover the whole tangent space of M at v, (74) implies that 

N 

^.s - P + ^ = 0 along d, (76) 

k=l 



where 



p = p' + p" and >lV = 0. (77) 

Had we argued directly on the basis of (62), the indetermination would not have 
appeared. 



10.4. Constitutive restrictions 



All measures of interaction introduced so far are state functions; for an ample class 
of bodies, they may be related to the expression of their free energy. It has become 
fashionable for this purpose to exploit a mechanical version of the second law of 
thermodynamics. We are aware of the disdain accruing from the overwhelming resort 
to developments a la Coleman and Noll (1963) and of the disputes attending thereof 
(to the point of accusing those authors of having revived a long-discredited phlogis- 
ton conjecture). We realize also that a satisfactory and general thermodynamics of 
continua with substructure is not yet available. However, the corollaries we draw are 
simple and elegant and may even find alternative justifications. 

The version of the second law that we exploit enjoins that for a subbody b/?, 
d 

— {free energy of b^} - < 0. (78) 

We consider bulk (^), surface (0), and junction (co) free energy densities such in a 
way that the free energy of a subbody b/? around the junction is given by 



{free energy of b/?} 



/ 

JbR 



^ r 



k=\ 



<Pk + 




(79) 



Once explicit expressions of the free energy densities have been selected, we can 
derive restrictions on possible constitutive expressions for the measures of interaction. 

First we consider a subbody b far from the interfaces. With reference to b, we 
write the mechanical dissipation inequality as 

-t: f f — f iP^ ■ X + pfi ■ v) — f (Tm • X + <Sm • v) < 0. 

at Jb Jb Jdb 



( 80 ) 
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By using the Gauss theorem on the last integral in (80) and taking into account the 
balances (6) and (34), we may rewrite (80) as 



We consider a constitutive structure of the bulk free energy of the form 

= )^(F, V, Vv), 



(81) 



(82) 



and presume that the measures of interaction depend on the same objects entering the 
constitutive list of only. By developing the time derivative in (81), we obtain 

f ((9fV^ - T) • F + (9vV^ - O • ^ + (9vvV^ - S) • Vv) < 0. (83) 

Jh 

In principle, given any state (F, v, Vv) of the material elements in the bulk, we may 
choose arbitrarily velocity fields (F, v, Vv) from (F, v, Vv). Consequently, as b is 
arbitrary (but taken away the discontinuity surfaces and the junctions) and (83) is 
linear in the velocity fields, to satisfy the inequality it is necessary that 



T = 9fV^(F, V, Vv), 


(84) 


C = 9vV^(F, V, Vv), 


(85) 


S = 9vvV'‘(F, V, Vv). 


(86) 



For subbodies bi: around the kih discontinuity surface, like the “pillbox” adopted 
in Section 10.3.5, we write the mechanical dissipation inequality as 



4-( f i^+ f <p)- f (pt ■'}^ + pP ■ V) - f (Tm • X + 5m • v) 

t/bsHE / t/bs Jdhj^ 



-i 



(Tn • 4- Sn 



abEfis 



v^) - /* (b X 
•/ bsDE 



X^ + P-V^) < 0. 



(87) 



If we adopt the constitutive relation (82) and a constitutive structure for the 
surface free energy of the form 



4> = $(F, v=*=,N), 



( 88 ) 



where N = V^v = (Vv>(I — n 0 n), and presume that the surface measures of 
interactions depend only on the entries of 0, by developing the time derivatives of 0 
and 0 and using (84)-(86), we reduce the inequality (87) to 



f ((9p0 - T) • F + (9y±0 - 3) • v"^ + (9 n0 - S) • N) 

7bi:nE 

- f ((T)n.[x] + (5)n.[v])<0. 



(89) 



Even in this case, for any state (F, v^ , N) of any material element of the interface 
E, we may choose arbitrarily velocity fields (F, v^, N). Consequently, since (89) is 
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linear in the rates (F, N), thanks to the arbitrariness of it is necessary that for 

each kth discontinuity surface 



Tfc = 9 f*<^*(F*, N*), 


(90) 


ik = dv^ki^k, N*), 


(91) 


Sfc = 9N*</»i(F<:, V^, Nt). 


(92) 



Moreover, the arbitrariness of bx: leads to the local surface dissipation inequality 



(T)n • [x] + {S)n • [v] > 0. (93) 

Note that, as is the average (v) around E, for jVl a nonlinear manifold, (v) would 
lose significance had we not proceeded to the embedding in a linear space. If v is 
continuous across Ejt, as assumed in previous sections, then = v. 

To obtain constitutive restrictions at the junction, we consider a subbody b r 
around 3 (as we have described in Sections 10.2.2 and 10.3.6) and we write the 
mechanical dissipation inequality as 



— 

dt 





(pf • X + . v) 




• X + Sm • v) 





(94) 



- (h . x(S2) - h • x(si)) - ((m + m)(S2) • 0(S2) - (m + m)(Si) • ^(si)) 
-J2 {hk-\f + pk- i>t) - (S • v|r(s2) - s • v|r(Si)) < 0, 



where ^(s) is the rotation of the plane coincident with the plane orthogonal to 3 at 
r(S) in the reference configuration. 

If we use the balances in the bulk and at each discontinuity surface, together 
with the balance of standard and substructural interactions at the junction, namely 
equations (69) and (76), we may rewrite (94) as 



\JbR 7b/?nEjt 



(T.F 

Js\ j »/bs 



+ ^ • V + 5 • Vv) 



-/ ((T>n • [X] + (<S)n . [v]) - f (T • F;t +3 • + S • N) (95) 

/’®2 • _ ■ 

- I (^•A, + (m + m)-K + p- v + 5- (Vv)t) < 0, 

Jsi 

where A. = Ftandx- = 0,s- We use the constitutive relations (82) and (88) together with 



CO = K, V, (Vv)t) 



(96) 
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(presuming that all the line measures of interaction depend only on the entries of 
co), develop the time derivatives, and insert them in (95). The results (84)-(86) and 
(90)-(92) allow us to reduce the mechanical dissipation inequality (95) to 

f (iho) - h) • A. + (d^co - (m + m)) ■ 

M ^ (97) 

-h (dyCo — p) . V + (9(Vv)t^y — s) • (Vv)t^ > 0. 

For any state (A., /c, v, (Vv)t) of a material element of the junction, we can choose 
rates (A., /c, v, (Vv)t) arbitrarily from (X, k, v, (Vv)t). However, as (97) is linear in 
the rates involved and [ 81 , 82 ] is arbitrary, the following constitutive restrictions 
follow: 



^ = dx(0(k,K, V, (Vv)t), 


(98) 


m = dicCo{X, K, V, (Vv)t), 


(99) 


p = d^co(X, K, V, (Vv)t), 


( 100 ) 


s = 9(Vv)to)(A,, K, V, (Vv)t). 


( 101 ) 



The relations (98)-(101) are peculiar results of the present paper, together with the 
balance of substructural interactions at the junction. 

Remark 1 (independence of the free energy from the rate of the state variables). In 
principle, we can choose for the free energies expressions different from (82), ( 88 ), 
and (96). However, well-known developments would exclude such a dependence. 

Remark 2 (parabolic evolution due to dissipative phenomena). The constitutive re- 
strictions (84)-(86), (90)-(92), and (98)-(101) are peculiar of the thermodynamic 
equilibrium. Nonequilibrium parts of the measures of interaction, depending consti- 
tutively on the velocity fields, could be considered along thermodynamical processes 
of nonequilibrium. This happens, e.g., when viscosity phenomena occur and the stress 
T has the form 



T = ap^(F, V, Vv) + T'^^(F, V, Vv; F). (102) 

It may even involve the rate of the order parameter and possibly of its gradient. 

An interesting case occurs when one considers an expression analogous to (102) 
for the self-force f , namely, 

^ = a^i/r(F, V, Vv) 4- z'*^(F, V, Vv; v), (103) 



and f is such that 



A solution of (104) is 



• V < 0. 



(104) 



= Av 



(105) 
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with A an appropriate definite negative tensor (possibly scalar in some special model) 
such that 

A = A(F, V, Vv; v). (106) 

In common special cases, decomposed free energy densities of the form 

= 'i^i(F, v) + Vv) (107) 

(with (I, 0) = 0, 1 being the unit tensor) may be selected and Vv) chosen as 

\j/2 = (108) 

with b some appropriate constant and l{v) a coarse grained potential, possibly with 
double wells as in the case of solidification or solid-solid phase transitions (see, 
e.g., Penrose and Fife, 1990; Anderson et al., 2000; Nestler et al., 2000). When this 
happens, from (85) and (86) the bulk balance of substructural interactions becomes 

A(p = bAv — 9vi(v) — (109) 

which is a generalized form of Ginzburg-Landau equation. When, in fact, both A and 
V are scalars and the body does not undergo deformations, equation (109) reduces to 

A(p = bA(p — (110) 

which is the standard Ginzburg-Landau equation. 
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